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Simultaneous production of photon pairs. 

Spontaneous parametric down conversion is a quantum photonic process that 
produces pairs of photons having energy and momentum that are precisely 
determined by the nonlinear optical environment in which this production takes 
place. Each photon of a pair is emitted along the surface of a cone. In this 
photograph, different photon pairs are color-coded. The circles correspond to the 
cross section of the emission cone for each photon. For the pair in green , the 
energy of each photon is the same, but their polarizations are orthogonal. Where 
the cones intersect, the polarization of each photon is a superposition of states, 
and the photons are said to be entangled. Such photons are the workhorses of 
numerous quantum photonic experiments and applications. See Chapter 5 for a 
complete presentation. (Reproduced by the kind permission of the Physics 
Faculty of the University of Vienna, © Faculty for Physics, University of 
Vienna. Picture credit: Paul Kwiat & Michael Reck) 





Preface 


Electrons and photons are fundamental particles that we encounter in our daily 
lives in plain view inside and outside the laboratory. Photonics treats the 
interaction of electrons and photons in the laboratory and in devices such as 
lasers and photodetectors. Quantum photonics is a textbook and reference for the 
behavior of electrons and photons in an environment that puts their quantum 
mechanical behavior in the foreground. 

Following the conjecture of Louis de Broglie, we ascribe to all elementary 
particles both particle and wave behavior. Two important manifestations of 
quantum behavior are tunneling and interference. A single electron or single 
photon can be made to interfere with itself by causing the wave function to 
follow two different trajectories, followed by recombination of the wave 
function at an observation point. 

Photon tunneling can be observed by pressing your finger against the outside 
of a partially filled water glass, making it possible to see your finger through the 
water, an example of frustrated total internal reflection. Observation of electron 
tunneling requires smaller structures with dimensions similar to those of an 
electron wavelength. 

When a single electron encounters a tunneling structure, its quantum wave 
function is separated into two separate spatial regions, representing the 
probabilities for transmission through or reflection from the structure. When a 
single photon encounters a beam splitter, its quantum wave function is similarly 
separated into two spatial regions, representing probabilities for transmission and 
reflection. The effect in both cases is the delocalization of the particle wave. In 
the case of the photon traveling at the speed of light, the delocalization can be 
present over very large distances. However, the detection of the quantum occurs 
at a single well-defined location in space, even though components of its wave 
function may be widely separated. 

Understanding the non-local behavior of electron and photon quanta is the 
theme of this text book. Achieving control and deriving benefit from this effect 
is an important frontier of optics + electronics = photonics. An early observation 
of electron interference and tunneling with technological importance is the 
Josephson effect, which is the basis of the superconducting quantum interference 
device, or SQUID. This device enables precision measurement and control of 
magnetic field in NMR imaging scanners. The development of the quantum 
cascade laser represents the control and exploitation of electrons and photons in 
a quantum tunneling device. The manipulation of delocalized single photons is 



the basis of the emerging technology of secure communications via quantum 
cryptography. 

These technologies represent a departure from the evolution of classical 
optics and semiconductor technologies. The world of quantum photonics is both 
radically different and largely unexplored. The goal of this text is to provide an 
introduction and foundation for those seeking to make the next contributions. 

Thomas P. Pearsall 
Paris, France 
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1. Electrons 

Thomas P. Pearsall 1 E 

(1) Paris, France 


s Thomas P. Pearsall 
Email: pearsall@ieee.org 

Abstract 

Electrons are wave-particles with mass, charge and magnetic moment. An 
electron carries the unit charge of the electro-magnetic field. An electron can 
have a wide range of kinetic energies, depending on its velocity. It can be 
accelerated, in which case it also radiates photons. Electrons can interact with 
matter through mechanical impact or electro-magnetic influence. Only twenty- 
seven years separate the discovery of the electron as a particle carrying electric 
charge and the revolutionary proposal that it also displays wave-like properties. 
The electron is thus a fundamental particle with both wave and particle aspects. 
The Pauli exclusion principle says that no two electrons can occupy the same 
state (same place, same time, same state with the same magnetic moment), and 
we derive the Fermi-Dirac statistical distribution, which gives the probability 
that a state can be occupied by an electron. The wavelength—momentum 
relationship that follows from the de Broglie’s hypothesis leads naturally to the 
Schrodinger wave equation which carries important information concerning the 
energy and momentum of the electron. 


1.1 Introduction 

The first two chapters of this book introduce the important properties of 
electrons and photons . These are fundamental, indivisible quantum particles that 
are also common to our everyday experience in the macroscopic world. For 
example, electrons and photons work together to produce Lasers, LEDS, full 




color displays and cameras. 

Electrons and photons display both wave-like and particle-like behavior. 
Electrons are wave-particles with mass. An electron carries the unit charge of the 
electro-magnetic field. An electron can have a wide range of kinetic energies, 
depending on its velocity. It can be accelerated, in which case it also radiates 
photons. Electrons can interact with matter through mechanical impact or 
electro-magnetic influence. Photons have no mass and move with the speed of 
light. A photon is a quantum counter of the excitation of the electro-magnetic 
field. A photon can thus be created (absorbing energy). Once created a photon 
has a fixed energy that is proportional to its frequency. A photon can be 
destroyed (releasing all of its energy). Under special circumstances, a pair of 
photons can experience interference of their mutual wavefunctions. Otherwise, 
photons do not interact with each other. 

The electron is an elementary particle characterised by its mass ( 
m e == 9 x 10 kg) 5 charge (4 = 1-6 x ^ 19 Q> and magnetic moment ( 

fi e = -0.928 xl O-^JT- 1 ) 

Here is a brief history: 

Shortly after the big bang, when the universe had cooled quite a bit, stable 
electron particles first appeared in our universe. 

1897: J.J. Thomson deduces the presence of the electron from studies of 
light radiated by a gas undergoing an electric discharge. He uses this 
apparatus to measure the mass-to-charge ratio of the electron by measuring 
the deflection of a beam of electrons in a magnetic field. Thomson wins the 
Nobel Prize in 1906. 

1909: R. Millikan measures the charge of an electron in a classic 
experiment. By balancing the gravitational force on an oil drop by the 
opposing attraction of an electric field, Millikan shows that the attraction is 
quantized in units of electronic charge. Millikan wins the Nobel Prize for 
this work in 1923. 

1913: Niels Bohr proposes that electrons circulate in stable circular orbits 
having well-defined energies around a compact and massive nucleus. Bohr 
is awarded the Nobel Prize in 1922. 

1914: James Franck and Gustav Hertz demonstrate that the energies of 
electrons in an atom are quantized. Einstein is reported to have remarked, 
“This work is so beautiful, it makes you cry.” They receive the Nobel Prize 



in 1926. 


1915: A. Sommerfeld develops the Bohr-Sommerfeld model of the atom, 
introducing elliptical orbits giving quantization of angular momentum 
within a single energy level. Sommerfeld does not win a the Nobel Prize. 
However, four of Sommerfeld’s PhD students: Werner Heisenberg , 
Wolfgang Pauli, Peter Debye , and Hans Bethe , and three of Sommerfeld’s 
post-docs Linus Pauling , Isidor Rabi and Max von Laue , went on to win 
Nobel Prizes. 

1924: L. de Broglie proposes that electrons in motion have a wavelength, 
and that stable electron orbits have a circumference equal to an integral 
number of such wavelengths. He claims credit for explaining the physical 
basis of the quantization of angular momentum of electrons in atoms 
proposed by Bohr and Sommerfeld. This idea is quickly generalized to 
show that all particles have a wavelength equal to Planck’s constant divided 
by the momentum of the particle. The de Broglie wavelength is the first 
crude estimate of the “size” of an electron, and it is seen that this “size” is 
variable, depending on its momentum (and thus, its energy) relative to the 
observer. De Broglie wins the Nobel Prize in 1929. 

1925: Wolfgang Pauli proposes the exclusion principle in order to explain 
why the electrons of an atom were not all in the lowest possible energy 
level. This principle says that no two electrons can occupy the same 
quantum state. As the atomic number rises, additional electrons may 
populate only the unoccupied states at higher energies. Pauli had to wait 
until 1945 to receive the Nobel Prize for this contribution. 

1925: Clinton Davisson and his former PhD student Lester Germer at Bell 
Labs in New York City directly observe angle-dependent diffraction of 
electrons from the surface of crystalline nickel. Electron microscopy is 
born. Only Davisson gets the Nobel Prize in 1937. 

1926: Fermi-Dirac statistics and Fermi level. Fermi wins the Nobel Prize in 
1938 for the discovery of nuclear fission, the basis of the atomic bomb and 
nuclear power. Dirac shared the 1933 Nobel Prize with Schrodinger for new 
quantum mechanical theory. 

1973: Hans Dehmelt uses a molecular trap developed independently by 
Wolfgang Paul that permits a single electron to be isolated and studied. 
Dehmelt and Paul share the Nobel Prize in 1989. In his Nobel lecture, 
Dehmelt quoted a phrase from Einstein, “You know, it would be sufficient 


to really understand the electron.” 


1.2 de Broglie’s Second Idea 

Louis de Broglie was born in 1892 to a French noble family. His older brother 
Maurice, born in 1875, was an accomplished experimental physicist who was 
invited to attend the first Solvay Conference in 1911, often celebrated as the 
starting point for quantum physics (see Fig. 1.1). Louis was a theorist and 
worked on his doctoral thesis in the early 1920s at the University of Paris. Leon 
Brillouin was a classmate and colleague of Louis during these years. Brillouin’s 
father, Marcel also participated in the 1911 Solvay Conference. Thus, de Broglie 
and Brillouin were steeped in and inspired by the quantum revolution which was 
taking off at the same time as their graduate studies. 



Fig. 1.1 The first Solvay Conference, held in the Hotel Metropole in Brussels (see http://en.wikipedia.org/ 
wiki/Solvay_Conference). Marcel Brillouin, the father of Leon Brillouin is seated 2nd from the left , next to 
Eugene Solvay, the wealthy Belgian industrialist who thought up the idea of a conference of the most 
brilliant scientists of his day. Maurice de Broglie, the older brother of Louis, is standing 6th from the left. 
Albert Einstein, standing 2nd from the right attended this as well as many subsequent editions of this 
conference. (Photograph by Benjamin Couprie, 1911, public domain, https://commons.wikimedia.org/wiki/ 
File: 191 l_Solvay_conference.jpg) 


De Broglie defended his thesis in late November of 1924. The cover page is 
shown in Fig. 1.2. 
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Fig. 1.2 Cover page for the doctoral thesis of Louis de Broglie. Each doctoral candidate had to write on 
two subjects: one chosen by the candidate, and one assigned by the faculty. The title of his chosen subject 



is: “Research on the Theory of Quanta” 


The thesis is short, about 100 pages in all. Almost all of the chapters are 
concerned with the effect of special relativity on the properties of various 
fundamental particles such as the energy and phase of a propagating light beam. 

In Chap. 3 there is an abrupt change of subject, and de Broglie addresses the 
hypothesis developed by Bohr to explain the existence of discrete atomic energy 
levels. Seven years earlier, Bohr proposed that the electrons in atoms traveled in 
stable orbits, thus allowing atoms to have long lifetimes, an experimental truth 
we all recognize. The condition originally proposed by Bohr was: 


ntQtiiR 2 = n — t 

2 7T 

where m is the mass of the electron, ^ the angular frequency of rotation 


( 1 . 1 ) 


around the atom, and R the radius of its orbit. For a circular orbit, aj ~ and 


Bohr’s condition becomes: 


in^vR = n^-. ( 1 . 2 ) 

where v is the linear speed of the electron particle. 

This has the simple interpretation that the angular momentum of the electron 

(=mvR) j s quantized in units of h-bar = ^ = h- However, in 1924 there was no 

idea about why this quantization occurred, or what properties of the electron 
assured this behavior. 

On page 44 of his thesis, de Broglie offered an interpretation that was 
consistent with his everyday experience: the Bohr condition was similar to the 
behavior of waves of water in a closed circular tank. Stable states occur when 
there are standing waves. The condition for the existence of a standing wave is 
that the length of the circuit be an integral number of wavelengths of the 
standing wave. There are only certain fixed lengths of the canal that can support 
standing waves. The possible canal lengths are given by the relationship L - nA . 

The argument of de Broglie contains no equations. It appears on page 44 of his 
thesis and is shown in Fig. 1.3. 
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Translation of the highlighted portion: 

"The propagation (of the electron) \$ therefore analogous to that of a wave of liquid in a tank 
that forms a closed path. In order to have a stable condition for the wave, it is physically 
evident that the lengih of the tank must be in resonance with the wave. In other words, the 
portions of the wave that are located a full length / of the tank behind preceding portion of the 
wave must be in phase with the preceding portion. The condition for resonance is / = ii/1 /' 

Fig. 1.3 The proposition by de Broglie in his thesis: the stable orbits of electrons in atoms are like 
stationary waves of water in a closed circular tank 


If we substitute the resonance condition of de Broglie into (1.2) (remember 


that K = we get: 



tHfiV (nA) = nh 
h 

tn Q \ = - 

A 


(1.3) 


Equation (1.3) says that the electron has momentum that is inversely 
proportional to its wavelength. This simple conclusion does not appear in de 
Broglie’s thesis, nor does the extension of this result to free electrons or other 
particles like photons. However, de Broglie let the cat out the bag so to speak, 
for which he was awarded the Nobel Prize in 1929. He claimed credit in his 
thesis for “the first plausible physical explanation for the condition of stable 
orbits as proposed by Bohr and Sommerfeld;” The principal result of his idea 
was to open the way for the development of Schrodinger’s wave equation, and 


the first quantitative description of the behavior of electrons and atoms. 

I find that the most interesting part of de Broglie’s reasoning to be the notion 
that because quantization exists, there must be an associated wave behavior . 

In practice it is quite difficult to measure the momentum of an electron, but 
quite easy to determine its energy. The wavelength of an electron, (or any other 
particle-wave) is expressed in terms of its energy and mass as: 


A = 


h 

yfhnE 


(1.4) 


Example 1.1 

The wavelength of an electron accelerated by a potential of 1 eV is: 
1 eV = 1.6 x l(T 19 J 


V 2 9x Kr31 1-6 x 10‘ I9 J 


The de Broglie wavelength associated with a particle is variable and depends on 
the relative difference in energy state between the observer and the particle in 
question. For example, if you were able to travel alongside the 1 eV electron at 

the same speed (about 6 x km s , a small fraction of the speed of light), The 

electron would appear to be nearly stationary, and its wavelength would be very 
long. Different observers will attribute different wavelengths to the same 
electron depending on their relative energy difference. 

Equation (1.4) works both ways. If a particle is confined physically in space 
by potential barriers, then this puts an upper limit on the wavelength that the 
particle can have. In turn this means that the energy of the particle is increased: 


E = 


where/, = -and* =- 
2 mA 2 2m 2k A 


(1.5) 

From the macroscopic point of view, such a particle would appear to have a 
net momentum of zero. However, the particle “motion” consists of 2 counter- 

propagating de Broglie waves each having momentum j. 


The de Broglie wavelength applies to any particle including atoms and 
molecules, even to photons. If we apply the relationship to Planck’s equation, for 
example, 


E = hv = —c = pc, 

A 

where p is the momentum of the photon, and c = the speed of light. 


( 1 . 6 ) 










Louis de Broglie and Leon Brillouin (who studied the motion of electron 
waves in a periodic environment like a crystal) both attended the fifth Solvay 
Conference was held in Brussels in 1927. Nearly 20 years after the first 
conference, Mr. Solvay had now built a beautiful library in the Leopold Park in 
Brussels. The participants’ photo is taken outside this building (see Fig. 1.4). 
They represent in a large part a new generation of physicists whose work would 
revolutionize the understanding and analysis of quantum phenomena. 



Fig. 1.4 The Solvay Conference of 1927. The conference topic was electrons and photons. This meeting 
could be considered to mark the beginning of Quantum Photonics. Two-thirds of the participants would 
eventually win the Nobel Prize. While there are still a few people who attended the 1st Solvay conference, 
notably Curie, Einstein and Bohr, the participants here for the first time represent a new generation of 
physicists, who developed modern quantum mechanics. Among them are Louis de Broglie, brother of 
Maurice and Leon Brillouin, son of Marcel. (Photograph by Benjamin Couprie, 1927, public domain), 
https://commons.wikimedia.Org/wiki/File:1927_Solvay_conference.jpg) {Back row L-R: A. Piccard, E. 
Henriot, P. Ehrenfest, E. Herzen, T. de Donder, E. Schrodinger, E Verschaffelt, W. Pauli, W. Heisenberg, 
R.H. Fowler, L. Brillouin, Middle row L-R: P/Debye, M. Knudsen, W.L. Bragg, H.A. Kramers. P.A.M. 
Dirac, A.H. Compton, L.V. de Broglie, M. Born, N. Bohr, Front row L-R: I. Langmuir, M. Planck, M. 
Curie, H.A. Lorentz, A. Einstein, P. Langevin, C. E Guye, C.T.R. Wilson, O.W. Richardson) 


1.3 The Pauli Exclusion Principle 

The year is 1925, and the physics community has been stunned in 1924 by the 












proposition of de Broglie on one hand, and on the other the experimental 
verification of electron diffraction at Bell Labs by Davisson and Germer. 
Electron waves are real and measureable. 

An excellent description of the development of the Pauli exclusion principle 
has been given by Pauli himself: 

The history of the discovery of the «exclusion principle », for which I have 
received the honor of the Nobel Prize award in the year 1945, goes back to 
my student days in Munich... It was at the University of Munich that I was 
introduced by Sommerfeld to the structure of the atom - somewhat strange 
from the point of view of classical physics. The series of whole numbers 2, 

8, 18, 32, giving the lengths of the periods in the natural system of chemical 
elements, was zealously discussed in Munich, including the remark of the 
Swedish physicist, Rydberg, that these numbers are of the simple form 2n 2 , 
if n takes on all integer values. 

A new phase of my scientific life began when I met Niels Bohr 
personally for the first time. This was in 1922, when he gave a series of 
guest lectures at Gottingen, in which he reported on his theoretical 
investigations on the Periodic System of Elements. The question, as to why 
all electrons for an atom in its ground state were not bound in the innermost 
shell, had already been emphasized by Bohr as a fundamental problem in 
his earlier works. In his Gottingen lectures he treated particularly the 
closing of this innermost K-shell in the helium atom and its essential 
connection with the two non-combining spectra of helium, the ortho- and 
para-helium spectra. However, no convincing explanation for this 
phenomenon could be given on the basis of classical mechanics. It made a 
strong impression on me that Bohr at that time and in later discussions was 
looking for a general explanation which should hold for the closing of every 
electron shell and in which the number 2 was considered to be as essential 
as 8. Excerpts from the Nobel Lecture by Wolfgang Pauli, December 13, 

1946. 

Pauli based his exclusion principle on experimental studies of the splitting of 
atomic spectra in magnetic fields. These experiments show the number of states 
in each electron shell. However, he was baffled by the factor 2 in the Rydberg 
formula 2n 2 . To solve this he also proposed a new quantum number, which was 
later identified by Goudsmid and Uhlenberg as an intrinsic magnetic moment 
called spin . With the aid of this concept, Pauli could propose that each known 
state for electrons in an atom could be occupied by at most one electron. The 



principal result of this idea is that atomic states are filled successively electrons, 
each having a unique combination of 4 quantum numbers. 

Electron spin is only one example of how Pauli looked for the “missing link” 
in physics. By 1930, he was studying the particle tracks of beta-decay, trying to 
understand the apparent violations of energy and momentum conservation. In a 
letter to the attendees of a physics conference in Tubingen, Germany, he 
proposed a “desperate remedy”: the existence of a new neutral particle, nearly 
impossible to detect, as a solution. As before, he did not give a name the 
proposed quantum. Enrico Fermi thought up the name neutrino shortly 
thereafter. The neutrino group is today a keystone of the standard model, and the 
subject of the Nobel Prize awarded in 2015. 


1.4 Fermi-Dirac Statistics 

The Pauli exclusion principle means that no two electrons can be found in the 
same place at the same time with the same quantum numbers. This feature is a 
fundamental difference between electrons from photons. Two photons having 
identical properties of frequency and polarization are indistinguishable. They can 
be made to occupy the same space at the same time, leading to interference 
between them. Such interference is not possible for electrons. 

The fundamental difference between the behavior of electrons and that of 
photons can be seen on a macroscopic level as well as on the quantum level. 
Current in a photonic device is carried by electrons, typically in such large 
numbers that their behavior is better described by statistics of the ensemble 
rather than by following the trajectory of individual quanta. Any given material, 
such a silicon crystal, contains electrons. Under equilibrium conditions, these 
electrons occupy stable states in the material. Using principles of statistical 
mechanics , we can determine the probability that a given state, having a 
corresponding energy, is occupied by an electron. The Fermi energy is the 
energy at which the probability is 50% that a state is occupied. In an ideal 
semiconductor, for example, the Fermi level lies in the forbidden gap that 
separates the valence band, where the majority of states are occupied by 
electrons, from the conduction band where the majority of states are unoccupied. 

The Fermi-Dirac statistical distribution is based on the fundamental quantum 
properties of electrons. It is used to characterize the behavior of electrons in 
matter, including the current-voltage relationship in macroscopic electronic 
devices like transistors, and photonic devices like lasers and LEDs. 

In a statistical ensemble , each state has a probability of occupation P given 
by: 




(1.7) 


it+uN-E 

P = e tr 

N number of particles in the state in question 
P chemical potential 
E energy of the state 

k Boltzmann’s constant 

® is the grand potential: 


n = —k'f In 


Z 




( 1 . 8 ) 


For the case of an energy level that can filled by an electron, using the Pauli 
exclusion principle, there are two possibilities: either the state is unoccupied ( 

N = 0 ,E = 0), or it is occupied by 1 electron ( N = 1 ). As a result, 


n = -mn(i +<?^j 

The occupation probability density as a function of energy is given by 


(1.9) 


n(E) = 


an 
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= jtr|i 
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We can now choose a specific chemical potential p as a reference energy, and 
we call this the Fermi level, . The Fermi energy is the energy at which the 

probability that a state is occupied is equal to Vi. 


n (E) = 


1 


p + ,j 

The statistic^ for electrons were developed independently by Enrico Fermi 
and Paul Dirac . Equation (1.11) is called the Fermi-Dirac distribution. 


( 1 . 11 ) 


1.5 The Schrodinger Equation 

Following the experimental verification of the de Broglie’s proposal that 
electrons (and holes) have a characteristic wavelength, we can write down the 
wave function of a one-dimensional free electron as a general wave as follows: 













( 1 . 12 ) 


¥ (jc, t) = Ae Kkj ~ wt ' i = A^e - ^' 

The first spatial derivative of this wavefunction gives: 

= (1.13) 

Multiplying both sides by 


(x, r) - M¥ <a% f) - pW (jc, #) ( 1 . 14 ) 

where p is the linear momentum associated with the wavefunction, following 
de Broglie’s hypothesis. We define the momentum operator as: 


Pop = (1.15) 

Taking the 2nd derivative of the wavefunction gives: 

j2 

(jc, r) = -£ 2 ¥ (x, r) ( 1 . 16 ) 

Multiplying both sides by where m * is the effective mass of the electron 
(or hole) 


2m* dx 2 


¥(*,*) = 


J 

P 

2m* 


—vw-rvtet) 

2m* 


where T is the kinetic energy associated with the electron. 
It is left as an exercise to show that: 


(1.17) 


= £¥{_*:,*) (1.18) 

The principle of conservation of total energy states that 

Kinetic energy + Potential energy = Total energy 

Adding an external potential energy term V(x> t) to (1.17) gives 


Tm'dx 2 ^^' T) + V ^ t} T ^ t} = ^ t} = ^ ^ (1.19) 

Because this differential equation is separable in time and position, we can 
solve for the spatial dependence of the energies independently of the time 
evolution. This is easily verified by applying the wave function of (1.12) to the 
LHS of (1.19). 

-- T —¥ (A) + V (x) ¥ (x) = £¥ (X) 

2 m* dx 2 

A similar procedure can be used to solve for the time evolution of the 


( 1 . 20 ) 







wavefunction. The solutions of Schrodinger’s equation give stationary states for 
which total energy is conserved. 


1.6 de Broglie’s First Idea 

On the third page of his thesis, Louis de Broglie professed his belief that 
Planck’s quantum relationship , E ^, applied not only to photons, but also to 

matter in general. Thus, a particle like an electron, with a rest-mass energy given 

by Einstein’s relationship & = m o t ' 2 , would also have an innate frequency given 

2 

by w = ^. He wrote: 


The quantum relationship - Kf ) would probably not have much meaning 

if its energy were distributed in a continuous way in space, and we have 
seen that it is certainly not the case. We can therefore propose a grand law 
of nature: each amount of energy associated with a mass ^ is endowed 

with a periodical behavior of frequency /o such that: 

= >W' 2 ( 1 . 21 ) 

where /o is of course measured in the system appropriate to that amount of 

energy. 

This is a daring proposal, based on little more than intuition and dimensional 
equivalence. 

Does a particle have a measureable frequency that is proportional to its 
mass? In the case of a non-relativistic electron, this frequency is about 
1.3 x 10 20 Hz. This frequency is more than 5 orders of magnitude too high to be 
measured directly using today’s best instruments. Protons and atoms, of course, 
would have an even higher de Broglie frequency . However, it is possible to 
imagine an experiment that might be attempted using frequency-comb 
techniques developed by John Hall and Theodor Hansch , and recognized by a 
Nobel Prize in 2005. This is the subject of the following example. 

Example 1.2 

The following thought-experiment, diagrammed in Fig. 1.5, is based on an idea 
developed by Dr. Thomas Udem , Professor at Ludwig-Maximilians-Universitat 
Munich, Germany. 
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Fig. 1.5 Diagram of proposed experiment: a photon is emitted by a laser with frequency = flaser tuned to 
the energy difference between two levels in an atom. To conserve momentum, the atom recoils with 
momentum —j.flaser 

An atom is excited by a laser and absorbs a single photon with frequency = 
fLaser . By conservation of momentum, the atom recoils with momentum = 

-jLaser 


The rccoil energy = hfit&ou = 



I jf/Ler 

2 i/iqc 2 


( 1 . 22 ) 


The recoil frequency would be, according to Planck’s law ( 

■ & ( L23 ) 
Applying de Broglie’s first idea: that each quantity of matter has a frequency 

■7 

determined by its mass: we substitute this quantity in (1.22). 





(1.24) 
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'Broglie 

Both the laser frequency and the recoil frequency can be locked to a 
reference frequency lW generated by a frequency comb. This is the 


measurement technique invented and developed by John Hall and Theodor 
Hansch. 


/Las* =n\f„j (1-25) 

fRecoil = 1*2 fref (1.26) 

As a result: 

1 tr \ 

fBroglie = 2~J™/ ( 1 . 27 ) 

Note that no fundamental constants appear in this expression. The de Broglie 
frequency is determined by a ratio of and . 

From (1.23), we can estimate 


* I0' 5 Hz 

W » 1024 Hz 

h„„a ~ 10" Hz 

The de Broglie hypothesis would be established if the recoil frequency can 
be measured. In turn, this would establish equivalence between mass and time. A 
recently published experiment has claimed such a result (Lan et al. 2013). 
However, this claim is contested by some experts in the field (Luc Blanchet et al. 
2013). 

De Broglie received the Nobel Prize for his second idea, about the 
wavelength that can be attributed to matter. The Nobel award followed the 
experimental verification of this effect. The de Broglie wavelength is not an 
intrinsic property. Rather, it depends on the relative energy of the particle with 
respect to the observer. Thus, the wavelength of an electron in an electron 
microscope depends on its velocity with respect to the target it is probing. The de 
Broglie frequency, on the other hand, would be an intrinsic property of an atom 
or electron, at least in the non-relativistic regime. While the report of Lan et al. 
makes a tantalizing claim, more proof is needed to establish the acceptance of de 
Broglie’s first idea. 


1.7 Summary 

The electron is a fundamental particle with both wave and particle aspects. The 




Pauli exclusion principle says that no two electrons can occupy the same state 
(same place, same time, same state with the same magnetic moment), and we 
have derived the Fermi-Dirac statistical distribution, which gives the probability 
that a state can be occupied by an electron. The wavelength—momentum 
relationship that follows from the de Broglie’s hypothesis leads naturally to the 
Schrodinger wave equation that carries important information concerning the 
energy and momentum of the electron. 


Exercises 

1.1 a. What is the wavelength of a 1 eV photon? Prove that a free electron 

cannot interact with a free photon conserving energy and momentum. 

b. Calculate the wavelength of a 1 eV phonon in silicon. Comment on 
your result. 

1.2 Derive the time-dependent Schrodinger equation: 

(jc, r) = EV (jc, r) 
at 

Show that the wavefunction can be expressed: 

¥ {x, t) - Ae~^‘ 

where the allowed values of e are identical to those determined from 
the solutions of the time-independent Schrodinger equation 

1.3 An electron is localized in a one-dimensional potential well of width = L 
meters. Choose the value of the potential V = 0 at the bottom of the well 
and infinitely large outside the boundaries of the well. Fix one boundary of 
the well at x = 0 and the other boundary at x = L as diagrammed in Fig. 1.6. 



QO 



x=0 x-L 


Fig. 1.6 1-dimensional infinite potential well 

Assume that the electron wavefunction can be expressed: 

'P (jc) = A sin (jf) + Be os (.if) 

Show that the energies of the stable electron states are quantized and 
expressed by: 

trirTr 

F — - 

2 L-m 


1.4 Compare the Fermi distribution to the Boltzmann distribution. 

1.5 Determine the frequency and vacuum wavelength of a photon having the 
mass equivalent energy of an electron at rest. 
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Abstract 

The photon is an elementary particle characterized by its frequency and 
polarization. The photon displays both wave-like and particle-like behavior. A 
photon is the unit intensity of the electro-magnetic field. Once created, a photon 
is characterized by its energy ( E = M) which remains unchanged until the photon 

is destroyed. Maxwell’s equations describe the electromagnetic fields in the 
classical regime, including propagation of photon beams as a function of 
frequency and polarization. Maxwell’s equations represent a staggering 
accomplishment in physical theory, but they do not give a complete description 
of photon behavior. Two photons with the same quantum numbers (that is the 
same frequency and polarization) can (unlike electrons) occupy the same place 
at the same time. The probability that an energy state is occupied by a photon is 
given by the Bose-Einstein distribution. A single photon is an indivisible 
elementary particle. Single photons can be produced by spontaneous parametric 
down-conversion. This is an optical process that cannot by described using 
Maxwell’s equations. When a single photon is incident upon a beamsplitter, its 
wavefunction is separated spatially into two parts, corresponding to reflection 
and transmission. When a measurement is made, the entire photon will be 
detected either as 100% transmitted or 100% reflected. This result is an example 
of the Copenhagen interpretation of quantum mechanics. 


2.1 Introduction 




A photon is the unit intensity of the electro-magnetic field . Once created, a 
photon is characterized by its energy ( E = hf) which remains unchanged until the 

photon is destroyed. A photon has wavelike properties, and so it has a 
wavelength. While the energy or frequency is fixed, the wavelength of a photon 
can change depending on the index of refraction of the medium in which it is 
propagating. Photons, unlike electrons, do not interact with each other. They 
cannot exchange energy or momentum. 

Two photons with the same quantum numbers (that is the same frequency 
and polarization) can (unlike electrons) occupy the same place at the same time. 
In this situation, the photons are indistinguishable, and the respective 
wavefunctions of the photons can experience quantum interference. 

The optical beamsplitter is an instrument, for example a simple glass slide, 
that divides a photon beam into two different paths by partial reflection and 
partial transmission of the beam. When a single photon is incident upon a 
beamsplitter, its wavefunction is spatially separated into two parts, 
corresponding to reflection and transmission. However, a photon is an indivisible 
elementary particle. When a measurement is made, the entire photon will be 
detected either as 100% transmitted or 100% reflected. In the time and space 
interval between the beamsplitter and the detector, the photon is delocalized. The 
wavefunction is present in both paths, and these two components belonging to a 
single photon can be made to interfere with each other. 

In addition to reshaping the wavefunction of a single photon, the optical 
beamsplitter can be used to combine the wavefunctions of two separate photons. 
When these photons have the same properties of frequency and polarization, the 
beamsplitter, used in a combining mode will cause quantum interference effects 
to appear between the two photons. 

These two effects: separation and combination are among the key elements 
used to implement important applications of quantum photonics: quantum 
computation and quantum communication. 

2.1.1 Photon Properties 

A photon is a quantum of excitation of the electro-magnetic field. The 
characteristics of photons are: 

Mass = 0 

Energy is proportional to frequency: £ = M - h° J 

Speed: c = 2.98 - 10 K m s“- i n vacuum 



Wavelength: A ~ ij, where n is the index of refraction 

Spin (intrinsic angular momentum: = +1 (in units of for left-hand 
circularly-polarized light and -1 for right-hand circularly polarized light 

There are two additional fundamental properties of photons. 

• Any number of photons can occupy the same quantum state. This means 
that two or more photons can be indistinguishable. Quanta with this 
property are called bosons, and they obey Bose-Einstein statistics. 

• Photons are not conserved. Photons can be created (emission) or annihilated 
(absorption). Photon emission or absorption assures energy conservation in 
photonic materials and devices. 

2.1.2 Brief History of Photons 

Time zero: The Big Bang 

There is today significant experimental evidence for the reality of the Big 
Bang . According to this model, nearly 14 billion years ago, quantum 
fluctuations of the vacuum released a huge amount of energy over a space whose 
diameter was hundreds of millions of light years in less than 1CT 32 s (Lineweaver 
and Davis 2005). Following this explosion and subsequent expansion of space- 
time, the universe was composed of a soup of electrons, positrons, protons, 
neutrinos, and other fundamental particles, so dense that any photons emitted 
were reabsorbed. Some time afterward, when the universe had cooled 
sufficiently, protons and electrons could condense to form the first stable 
hydrogen atoms. Photons with energy insufficient to ionize these atoms could 
then propagate in space. This radiation is the oldest known photonic signal that 
can be measured directly. Measurement of this background radiation is an area of 
intense research today. Some important projects are the Cosmic Microwave 
Background Explorer (COBE) and the Wilkinson Microwave Anisotropy Probe 
(WMAP). Results from these projects have produced a compact timeline of the 
universe shown in Fig. 2.1. 
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Fig. 2.1 Credit WMAP Wilkinson Microwave Anisotropy Map . Photons, unlike electrons, were evident 
from the beginning (http://map.gsfc.nasa.gov/media/060915/index.html public domain) 


1665: Francesco-Maria Grimaldi carries out the first experiments on 
diffraction of light, inventing the word itself and publishes his results in his 
book Physics and Mathematics of Light, Colors and Rainbows, a 
significant work published two years after his death. 

1675: Newton studies Grimaldi’s work which serves as an inspiration for 
his study of light. Newton develops a corpuscular theory of light, but does 
not publish it, being aware of intense competition coming from the Paris 
Observatory. 

1676: Ole Roemer, working at the Paris Observatory demonstrates that 
light has a finite speed, by measuring the variations in periodicity of the 
eclipse of Jupiter’s moons. This periodicity oscillates as the earth orbits the 
sun, changing the distance between the earth and Jupiter. Roemer first 
measurement of this speed was 2.2 x 10 8 m s -1 , about 26% less than 
today’s established value. 

1678: Christiaan Huygens , working at the Paris Observatory presents wave 
theory of light, based on experimental observations. However, aware of the 
contradictory Newtonian theory, he delays publication until 1690. His book 
becomes a scientific classic. 
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1704: Newton, seeking to have the last word, publishes his corpuscular 
theory, hoping to supplant Huygen’s wave theory, and waiting until both 
Huygens and Robert Hooke , the major scientific advocates for the wave 
theory, were dead. For most of the eighteenth century, the Newton’s 
reputation insures that his theory prevails. 



1805: Thomas Young studies diffraction of light by a slit, experimental 
proof of the wave behavior of light. 

1835: James Maxwell introduces Maxwell’s equations which give a 
complete description of the propagation of light, uniting electricity and 
magnetism as aspects of the same phenomenon. 

1884: Oliver Heaviside studies Maxwell’s equations and starting from the 
20 equations and 20 unknowns proposed by Maxwell, reduces them to the 
four equations we use today. 

1901: Max Planck studies light radiated from objects as a function of their 
temperature. He concludes that light waves behave like particles with 
energy proportional to frequency. 

1905: Einstein studies the photoelectric effect, and argues that photo¬ 
ionization of an electron from a solid is made by a single photon having 
enough energy (proportional to its frequency) to free the electron from its 
bonded state. 









1927: Paul Dirac invents quantum electrodynamics, introducing Maxwell’s 
field equations into quantum mechanical calculations 

1982: Quantum entanglement of photon pairs—theoretical conjecture 
dating from 1935 is tested by experiments devised by Alain Aspect that 
demonstrate the non-locality of quantum mechanical particles like photons. 

2000-2014: After many decades of near neglect, Nobel Prize committees 
recognize the importance of photons and photonics eight times: 

2000— The semiconductor double heterostructure laser. 

2001— Observation of Bose-Einstein condensation using laser cooling. 

2005—High precision measurement of time using laser frequency 
comb techniques. 

2009—Invention of low-loss optical fibers. 

2009—CCD photon imaging. 

2012—Quantum non-destructive measurements of single photons and 
atoms. 

2014—Invention of high-brightness solid-state blue LEDs and lasers. 
2014—Super resolution optical microscopy. 


2.2 Bose Einstein Statistics 

Following the work of Planck and Einstein, an electromagnetic wave is 
composed of photons. The energy of the wave is given by the energy of each 
photon: multiplied by the number of photons having that frequency, summed 

over all the frequency states: 

E (<jj) = N (tjj) Tujj 

To determine the statistical distribution of photon states as a function of 
temperature, we use the same approach as we used in Sect. 1.4 (1.7 and 
following) to determine the Fermi distribution function. 

Each state has an occupation probability P given by: 

Q+vN-E 

p = e tr ( 2 . 1 ) 

N number of particles in the state in question 





t* chemical potential 

E energy of the state 

K Boltzmann’s constant 
^ is the grand potential: 
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( 2 . 2 ) 


where e is the energy of a particle, (in this case the photon) making up that 


state. 

Photons and electrons belong to entirely distinct classes of particles. Unlike 
the electron, any number of photons can occupy the same energy state: 
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where e - ^ is the energy of a photon. 
As a result, 
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The occupation density n as a function of energy is given by 


(2.4) 



(2.5) 


A second fundamental difference between electrons and photons is particle 
conservation. In the low energy environment of photonics (E < 10 keV), mass is 
neither created nor destroyed, and electron number is conserved. Photons, on the 
other hand, can be created or destroyed, and there is no conservation requirement 
on the number of particles. 

The chemical potential = M, measures the equilibrium condition of a system 


of particles. At equilibrium, ^ is the same everywhere. Otherwise, the system is 

considered to be in a non-equilibrium state. Consider first the case of electrons. 
If two regions with differing chemical potential, or Fermi level, are brought 
together to form a system, then electrons will move from one region to the other 
until the chemical potentials are the same and equilibrium is restored. The 
displacement of electrons creates an electric field and thus an electric potential 
difference between the two regions. This is why a pn-junction has a built-in 



potential. 

In the case of photons, where particle number is not conserved, if the 
chemical potential somewhere was higher than zero, photons would 
spontaneously disappear from that area until the chemical potential went back to 
zero; likewise if the chemical potential somewhere was less than zero, photons 
would spontaneously appear until the chemical potential went back to zero. 
When the number of particles is not conserved, as in the case of photons, we can 
set the chemical potential to zero everywhere. Making this substitution in (2.5) 
results in the Bose-Einstein distribution function: 

= ( J ( 2 . 6 ) 

The spectral radiance of an ideal black body is given by Planck’s law : 

CJT 1 l l n 

R{ta, T ) = Tito ——-— - in units of W Hz sr m ~ (2 7") 


2.3 Propagation of Photons 

The propagation of photons as waves is described by Maxwell’s equations which 
model light as a fluid. The validity of these equations has been proved over a 
truly astonishing range of conditions. When the theory of special relativity was 
proposed by Einstein in 1905, it was shown that Maxwell’s equations can be 
used without any modification to account for relativistic behavior. The scientific 
research of a number of scientists in the eighteenth and nineteenth centuries, 
synthesized by Maxwell as coupled electric and magnetic fields, and 
transformed by Heaviside into usable mathematical format, must rank among the 
greatest accomplishments of the human spirit. 

2.3.1 Maxwell’s Equations 

E and H are the electric and magnetic vector fields 
D is the electric displacement vector field 
B is the magnetic induction vector field 
P is the free charge density 

e is the dielectric constant 
J is the current density 





We will assume some simplifying conditions 
If the materials are macroscopic and isotropic, then D ( r ) = «{r)E (r) 

In dielectric materials, P = 1 , so B = H 
In lossless materials, J = 0 and P = 0 

V B = 0 

V D = 0 

Vx E = -f = -//'f ^ 2 - 8 ^ 

V x H - § = J = 0 

Any vector field (strain is a good example) can be expressed in terms of two 
concepts 

Expansion, that is hydrostatic strain 
Rotation, that is shear strain 

The divergence of a field measures the linear expansion of a vector field at a 
given point. This is a scalar number and is expressed as the dot product of the 
gradient and the field vector: V - E(r). 

The rotation is given by the cross-product of the gradient vector and the field 
vector. The rotation of the field is a vector, giving both the magnitude and the 
axis of the rotation. 

If ^ - B = 0 everywhere, that is, if there is not expansion or contraction, then 

only rotational components of the field remain, and so B can be expressed as a 
curl of a vector A. 

B=VxA 

Maxwell’s equations are quite general, and as a result, solutions differ 
depending on the geometry and properties of the medium in which light is 
propagating. 

In quantum photonics we often seek to find a relationship between quantum 
fields at the input and the output of a ‘device’. Many quantum photonic devices 
are produced as 2-dimensional planar structures, as a direct result of epitaxial 
crystal growth. The spacing between layers can be reduced in a quite 
controllable manner to quantum size dimensions; that is, for spacing dimensions 
similar to the wavelength of the particle-wave. Quantum effects can be observed 
in the 3rd dimension, perpendicular to the plane of crystal growth. It is 
convenient use a transfer matrix formalism that relates incident and reflected 
travelling field amplitudes. 



Example 2.1 

Propagation of light through a stack of materials having differing indices of 
refraction: an interference filter. 

In this example, diagrammed in Fig. 2.2, a photon enters from the left, 
propagating in a dielectric medium Region 1 with e = «i«o and M = f*o. It then 

traverses Region 2 with « = having thickness a, and exits in Region 3 

having the same parameters as Region 1. At each interface there is a finite 
probability for reflection and transmission. The objective of the example is to 

calculate the transmission coefficient 1 = , in Region 3 as a function of 

photon frequency. 


Region 1 

Region 2 

Region 3 


1 - 1 
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Fig. 2.2 Structure for example 2.1, showing light propagation in the z-direction through a structure with 
various dielectric constants 


Solution: 

The electromagnetic field is transverse to the propagation direction. Without 
loss of generality we can assume that 0 = 0. That is, the electric field is 

polarized in the x-direction for a wave propagating along the z-axis. 

Maxwell’s equation for wave propagation is: 

—E(z, r) - s/j — E(z, r) (2.9) 

A general expression for the solution to this equation is: & (zj) = 

In Region 1, we have: 





( 2 . 10 ) 


E (z, t) = Ae + R^- ki -^ 

and similarly for Regions 2 and 3. 

In this particular example, note that G = 0, because there is no wave coming 

from the right in Region 3. The wave vector * = 0^')will differ in each region 
according to the values of h and s , while OJ remains constant throughout the 
structure. 

Boundary conditions , specific to the example are necessary to proceed 
further. These apply to the interfaces between adjacent regions: 

(i) continuity of the electric field amplitude across the interface. This means 
that photons are neither destroyed or created as the wave crosses the 
interface. 


(ii) continuity of the first derivative of the electric field across the interface. 
This means that the momentum of the photons is conserved as the wave 
crosses the interface. 

For the interface at z = 0, 


(i) A + B ~ C + D 

(ii) k\A - k\B = k 2 C - k 2 D 

Solving for A and B in terms of C and D: 



and 


( 2 . 11 ) 



This result can be written conveniently in matrix form 


- 2 


( 2 . 12 ) 


The matrix is called a transfer matrix, and the procedure described 


below is called the transfer-matrix method . It can be adapted to calculate the 
optical and electronic properties of planar structures, both periodic and 



aperiodic, that are encountered in quantum photonic materials. 
For the interface at - = a 

(i) Ce ik2 “ + De~ ik2 “ = Fe & ' a + Ge~ ik ' a 
<ii) + De- &2a = F^ a + Ge^ 

Solving for C and D in terms of F and G: 


f c 1 

_ l ' 


" 2 

.J-K 



j ^l ^2 j l — ^2^ ^2 ^1 J i(i[ 

^ *2~*L j, gUki +k 2 )a ^ 

These matrices can be combined: 

F ' 


F 


N, 


i/ 


r A ' 



/ C ' 


- 

M is 







M;j 


N, 


ii 


G 


F 


G 


This allows us to determine directly: 

A = (Afii^ii + AY21iV12,) F } where we have set G = d. 

= (^) (^) = 4^| cos2 {*2°) + {fi + sin s <* 2 ^)J 

The transmission coefficient T is expressed: 


(2.13) 


(2.14) 


T = 


COS 2 {kja) + 


l-ri-j si 


(2.15) 


sin (ha) 


Note that when ha = nn where n is an integer, the transmission is unity, 


otherwise T < 1. 

As an example, we consider visible light passing through a thin film of Si0 2 

having an index of refraction of 1.5. We will assume the thickness of the film to 
be 2 pm. Equation (2.15) is evaluated as a function of photon frequency, because 
this remains constant in all regions, whereas the wavevector in Region 1 is 
different from that in Region 2. 

The graph of the transmission coefficient versus frequency for this case is 
shown in Fig. 2.3. 
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Fig. 2.5 Transmission of light through a thin film of SiC>2 at normal incidence 


The expected features are present: the transmission is periodic, although the 
minimum transmission remains about 85%. The periodicity depends directly on 
the film thickness, and it follows that transmission of photons away from the 
normal axis will increase the effective thickness of the film and therefore allow 
100% transmission at a different wavelength of incident light, creating a rainbow 
effect. Similar effects are observed if the film is not uniform in thickness across 
its plane, or if the film surface is not planar. 

2.3.2 Reflection and Transmission: Fresnel’s Equations 

When a lightwave leaves one medium and enters another, a part of the beam is 
reflected, and a part is transmitted. In the absence of scattering or absorption at 
the interface, the sum of the intensities of the transmitted and reflected 
components is equal to the intensity of the incident beam: ( j 2 + K 2 _ t , where T 
and R are respectively the transmission and reflection coefficient) The details of 
how the beam behaves will depend on the angle of incidence and polarization of 
the incident beam as well as the optical properties of the media on each side of 
the interface. 

Maxwell’s equations can be directly solved to obtain the amplitude and 
phase of the electric field relative to that of the incident beam resulting in 
Fresnel’s equations. The reader is referred to any introductory text on 
electromagnetic behavior (for example, Hecht 1987) for the derivation. 

The case of an electromagnetic wave incident on a boundary is diagrammed 
in Fig. 2.4. We assume that the materials on both sides of the boundary are 
dielectrics. The electric field vector is chosen to be perpendicular to the plane of 
incidence so that it lies parallel to the plane of the interface. Solving Maxwell’s 
equations gives: 

ti\ cos0| - n 2 cas& 2 (^) cos ^i _ cos ^ 

l E Jx /I | COS#| + ttj COS #2 (^)cOS0| + COS ft 


(2.16) 




and 



n | cos#| +n 2 CGS&2 


2 ft \ cos 


(2.17) 




Fig. 2.4 When an electromagnetic wave is incident on a boundary between two materials, part of the beam 
is reflected and part is transmitted. Maxwell’s equations require that the angle of incidence equal the angle 
of reflection. The plane of incidence is defined by the propagation vectors of the incident and reflected 
waves 


These are two of Fresnel’s equations . The remaining two describe the 
situation where the electric field is parallel to the plane of incidence. 

Equations (2.16) and (2.17) can be used to model the behavior of an optical 
beam splitter consisting of a simple glass plate. There are 2 cases to consider. 
Case 1: «■ < n i 

We could suppose that n x = 1 (air), and that n 2 - 1.5 (representing glass), 
for example. The incident beam strikes the glass surface from the air side. It is 
clear that (ir) ± is always real and positive. Thus the transmitted wave is in phase 
with the incident wave. On the other hand, if < then and 

costfi < cos (h Thus (!:)_ is negative, indicating a phase change or polarization 

change of n radians = _ 1 of the reflected wave relative to the incident beam. 
Case 2: 

This case corresponds to the wave incident on the interface from the glass 
side. The transmitted wave is in phase with the incident wave as before. For the 

reflected wave, we note that > « 2 , and that cos ^i >cos ^. (Iy)_ is positive for 

all angles of incidence. Therefore the reflected wave is in phase with the incident 






wave. 

The example in the previous section could serve as the basis for the analysis 
of a simple beam-splitter. We can make a description of such a beam splitter as a 
device with two inputs E 1 and E 2 . The outputs are E 3 and E 4 as shown in 

Fig. 2.5. 


*3 



Fig. 2.5 A model of an optical beamsplitter as a device with two input ports (E \ and E 2 ), and two output 
ports (E 3 and E 4 ) 


We can write the outputs as a function of the inputs. The electric field 
amplitudes at the 2 inputs are E 1 and E 2 , and the output field amplitudes are E 3 

and E 4 . 


E 3 = (RE l + TE 2 ) (2.18) 

£ 4 = (T£| + ^RE 2 ) (2.19) 

where the polarization reversal specified by Fresnel’s equations is required in 
(2.19). 


(Ejf = (RE t f + (TE 2 ) 2 + 277f£|£ 2 
(£ 4 ) 2 ^ (TE 1) 2 + (RE 2) 2 - 2TRE\E 2 
and 


( 2 . 20 ) 


E\ + E\ = (£|) (r 2 + R 2 ) 2 + (E 2 f (t 2 + r 2 ) = E 2 + E 2 , 
conserving energy as required. 


( 2 . 21 ) 


2.3.3 Single Photon Behavior 

In Fig. 2.6a, we show a beam splitter at work. A beam of light strikes a beam 
splitter, and the intensity of the beam is divided according to the properties of the 






beam splitter and the angle of incidence. 
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Fig. 2.6 A schematic view of photons passing through a beam splitter, a In the case of a beam of photons, 
Maxwell’s equations give a precise account of how the intensity of the beam is divided, b Although a single 
photon cannot be divided in two, the wavefunction is present in both paths. The photon, however, can be 
detected only once. Therefore it must be detected either at T, showing that the photon propagated straight 
through the beam splitter, or at R showing that the photon was reflected. Experiments by Grangier et al. 
(1986) have demonstrated that the choice between transmission and reflection does not occur when the 
photon passes through the beam splitter, but rather when the photon is detected 


In place of the light beam, suppose that we could substitute a source 
containing only a single photon. This case is shown in Fig. 2.6b. A photon is an 
elementary particle and cannot be divided. Hence, it must either be entirely 
detected by the detector in the transmission path or entirely by the detector in the 
reflection path. However, the Schrodinger equation shows that the probability 
intensity is non-zero for both paths. According to the Copenhagen interpretation 
of quantum mechanics, detection of the photon shows which path the photon has 
taken, and the wavefunction in the path not taken collapses to zero. 

How does the photon decide which route to take? No existing theory can 
explain this fundamental mystery of quantum photonics. The photon 
wavefunction is non-zero along both paths until detection occurs. We could say 
the photon “decides” which path it has taken only after it is detected. We 
continue this discussion in Sect. 2.4.3. 


2.4 Wave Behavior of Photons 
2.4.1 Computational Examples 

Example 2.3 








Propagation of light through a plasma layer 

Next, using (2.15) we will analyse the propagation of light through a plasma 
of electrons and protons. With equal numbers of electrons and protons, the 
plasma is electrically neutral. For simplicity, we will consider the problem in one 
dimension, as shown in Fig. 2.7. 

A free electron in such a plasma is repelled by other electrons. The electric 
field experienced by an electron in such a plasma can be calculated from 
Maxwell’s first equation: 



where q is the charge of one electron, and A* is the electron density. 
The force on the electron is expressed 


( 2 . 22 ) 


itL. 


d 2 _m_ 

dt 2 


0E 




E 


(2.23) 


where /(0 gives the position of the electron. Equation (2.17) is in the form 
of an oscillator 


dt 2 

where 




OJ i> = 


IttfE 


is defined as the plasma frequency . 


(2.24) 


In this example, a photon wave travelling along the z-axis encounters a region of 
plasma. 

Maxwell’s equations are: 

„ „ dH n 

V x E + — = 0 

at 

V x H - V - = 0 

at 

rjV if- 

where lT = is the conductivity of the plasma. 

The fields can be expressed as waves, 

E = 

Maxwell’s equation for the electric field is: 






dz 2 fit 2 dt 

Substituting for E yields the dispersion relationship between w and k: 

fr = ji£&r - tojtTji 

and 


fr = fieto 2 1 - 


K<t 




eitusj- 

Taking F =f *o and = so, 


.4 


(2.25) 


Jt 2 - 


T 




(2.26) 


When the wave frequency is less than the plasma frequency, (2.26) shows 
that the wave vector is imaginary (Fig. 2.8). Thus E-M wave propagation is 
strongly attenuated in this regime (Fig. 2.9). 
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Fig. 2.7 Schematic diagram showing the index of refraction of a planar plasma layer embedded in a 
dielectric region having an index of refraction = n \ 


Assume that Regions 1 and 3 are free space: 
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The dependence of H on frequency is shown in Fig. 2.8. 


(2.27) 

















Fig. 2.8 Dispersion relation showing the square of the wavevector as a function of frequency for a plasma. 
The wavevector becomes a real number for frequencies greater than the plasma frequency 

The dispersion relation is a non-local relationship between wavevector and 
frequency of wave-particles. Since momentum is directly proportional to 
wavevector, and energy is directly proportional to frequency, the dispersion 
relation can also be considered as the energy-momentum relationship for the 
wave-particle. Knowing the dispersion relation is often more important and 
informative than knowing the wavefunction. 

In the low frequency regime, OJ < OJ r and the wavevector in Region 2 is 

imaginary. As a result the photon waves are strongly attenuated. 

For incident waves having a frequency above the plasma frequency, the 
wavevector is real, and the conditions of the calculation resemble those we have 
just considered. 

Despite these rather different situations, the transmission coefficient is given 
by (2.15) for the entire frequency range both below and above the plasma 
frequency. 

In Fig. 2.9, we show the result of this calculation for a photon wave passing 
through a plasma sheath having a plasma frequency of 5 x 10 14 Hz and a 
thickness of 5 pm. 
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Fig. 2.9 Propagation of photons through a plasma sheath as a function of photon frequency. The vertical 

dashed line indicates the plasma frequency of 5 x 10^ Hz. The inset in the figure shows the same 
calculation with the transmission coefficient plotted on a logarithmic scale 

At first view, the transmission coefficient is near 0 for frequencies below the 
plasma frequency, and rapidly increases above it, showing an oscillatory 
behavior. However, there is more to this result. It can clearly be seen that some 
transmission of photons is permitted through the barrier, even when the 
wavevector is imaginary. By examining the inset of Fig. 2.9, it can be seen that 
for all frequencies, there is transmission of photons through the barrier, although 
the attenuation becomes stronger as the frequency diminishes. This 
phenomenon: transmission of a photon through a region where the photon 
should not be allowed is called tunneling. Tunneling is a fundamental feature of 
wave behavior. All particle-waves, photons, electrons, even atoms can display 
tunnelling. Important quantum photonic devices depend on tunnelling to 
function properly. 

2.4.2 Young’s Double Slit Experiment 

In 1805, Thomas Young demonstrated and interpreted the passage of a photon 
beam through a pair of slits in an opaque screen. His experiment showed the 
photon beam created a diffraction pattern associated with wave behavior of light. 
This work was revolutionary, overturning 100 years of belief in the Newtonian 
corpuscular theory, and rehabilitating the studies of Huygens. 

In Fig. 2.10 we show a sketch of the experiment as devised by Young. A light 
beam passes through a narrow opening and is incident on a pair of slits, 
characterized by the slit width (d) and their separation (a). The passage through 
the slits gives rise to diffraction effects and under the right conditions, a pattern 
of light and dark fringes can be captured on a screen behind the slits. 
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Fig. 2.10 Young’s two-slit experiment of 1805 demonstrated wave-like behavior of photons. Each slit 
radiates light with circular wave fronts. The arrows show where the beams overlap in-phase, giving rise to 
maxima in the intensity In between, destructive interference reduces the intensity 

Young sought to make the wave nature of light clear by demonstration. In 
order to optimise the diffraction effects, the light should be monochromatic and 
the phase front of the light should be the same at the entry of both slits. His 
critical innovation was the addition of a small opening in front of the two-slit 
passage in order to create a point-like source with spatial coherence. 

A simple geometrical analysis relates the wavelength of light, the distance 
between the slits and the angle of divergence between the rays of maximum 
intensity: nA ~ ^ 

Young’s experiment proves that photons have wave properties . However, it 
does not eliminate the possibility that photons also have particle properties . A 
century later these were shown by Planck and Einstein. The photon is now 
understood as being both a wave and a particle. Two centuries later, Young’s 
two-slit experiment is a subject at the frontier of fundamental understanding of 
quantum behavior. Experimental instrumentation has reached the point where it 
is possible to detect individual photons. Sources can be chosen so that the 
passage of individual photons though the two-slit interferometer can be recorded 
at the image plane. Since only one photon is present in the apparatus at any time, 
multiple photons cannot interfere with each other. Thus, the interference pattern 
that builds up is the result of each photon interfering with itself. An elegant 
experiment has been carried out by Dykstra and colleagues at the University of 
Leiden, showing the detection of photons and the build-up of the interference 
pattern as a function of time. 1 A video of this experiment can be viewed at: 
https://www.youtube.com/watch?v=MbLzhlY9POQ. 

Electrons, like photons, exhibit wave and particle behavior simultaneously. 
Akira Tonomura and colleagues from Hitachi Labs in Japan have carried out 
analogous experiments using electrons passing through a double slit etched into 
in a copper foil. Images taken at successive times from their 1989 paper 2 are 
shown in Fig. 2.11. 



Fig. 2.11 Results of a double-slit-experiment performed by A. Tonomura showing the build-up of an 
interference pattern of single electrons. Numbers of electrons are 11 (a), 200 (b), 6000 (c), 40000 (d), 
140000 (e) (reproduced from Tonomura et al. (1989) by permission of the American Physical Society) 


Since each electron passes individually through the arrangement of slits, 
these measurements show that the electron passes through both slits, and that the 
electron particle-wave interferes with itself. 

In Fig. 2.11a, each electron is detected only once at well-defined point in 
space, a direct result of the quantum nature of the electron. Over time, the 
passage of more electrons contributes to the build-up of the pattern of light and 
dark fringes, a manifestation of wave interference, based on the wavelength of 
the electron and the separation between the slits. 

This experiment demonstrates that the electron displays both wave and a 
particle behavior. It thus makes clear that the electron wave function interferes 
with itself. The electron wavefunction that represents the quantum is present in 
both slits, reflecting its delocalized nature. It is this delocalized nature that 
produces the interference shown in Fig. 2.11. It therefore makes little sense to 
ask through which slit the electron has passed. 

In the classic Young’s experiment where the source intensity is higher, each 
photon traverses the two-slit structure independent of all the others. Because the 
photon is a quantum, it can be detected on the other side at only one point. Thus, 
the process of detection forces the quantum to become localized at that point, so 
that its entire energy is given up to the detection apparatus in one interaction. 

2.4.3 Single Photons 

The complete understanding of measurements of photons in quantum systems is 















a work in progress. The double slit interference experiment reveals some 
important questions about the meaning of measurements, and the limits of what 
can be known and what must remain unknown in this simple experiment. 

In a measurement, the photon (or an electron) can be measured at one 
location in space and time. The double-slit experiment shows that prior to 
measurement the photon can be delocalized, sampling two slits separated by a 
macroscopic distance. By observing the experiment in real time, we see that 
apparently identical photons wind up in different places as a result of traversing 
the same double-slit structure. We could think of the interference pattern as 
reflecting the probability distribution that describes where a photon quantum 
(electron) will be detected. This behavior might suggest that the impact of the 
photon (electron) on the double-slit structure introduces a measureable 
structured statistical uncertainty in the trajectory of the quantum. 

The Copenhagen interpretation is a framework of principles that address this 
kind of question about the nature of quantum behavior prior to measurement. 
When applied to the double slit experiment, this interpretation would say that it 
is not possible to know how the photon passes through the double-slit structure. 
When the photon arrives at the image screen, it is in a superposition of all 
possible final states at the image plane. The amplitude of its wavefunction 
replicates the interference pattern that is observed after the arrival of many 
photons. When the photon is detected, by a sensor having a particular position 
fixed in space, the superposition of these states must collapse into the state that 
corresponds to the location of the detecting sensor. Thus, following the 
Copenhagen interpretation, the photon wave function remains delocalized up to 
the moment of detection. 

Grangier , Roger and Aspect, published in 1986 an account of pioneering 
and elegant experiments that show the particle and wave behavior of a single 
photon in an interferometer like the double-slit experiment. The presentation 
below follows the treatment of this experiment by M. Beck 3 of Whitman 
College. 

1. Passage of a photon through a beam splitter. 

This event corresponds to the illustration in Fig. 2.6 earlier in this chapter. 
The solution of Maxwell’s equations gives a transmission coefficient and a 
reflection coefficient. The situation is diagrammed in Fig. 2.12. 


Fig. 2.12 A single photon incident on an ideal beam splitter is transmitted either 100% to T or reflected 
100 % to R 


A single photon that traverses the beam splitter will either be transmitted or 
reflected. If the transmission detector T records a photon (= 1), then the 
reflection detector must record nothing (= 0). This result demonstrates anti¬ 
correlation. The calculation of the transmission and reflection coefficients of 
Example 2.1 indicates the relative numbers of counts at the transmission and 
reflection detectors if many experiments are conducted. In this description, we 
are supposing that the detectors are 100% efficient, that there are negligible 
losses along the trajectory of the photon. 

We can quantify this result as follows: 

Suppose that p T = probability that a photon is detected only at T, p R 
= probability that a photon is detected only at R, and that p TR = probability that 
a photon is detected at both T and R at the same time. We can express the degree 
of correlation, the correlation coefficient K 121 by the ratio: 


8 {2) {t = 0)= . (2.28) 

(Pt) (Pr) 

where r is the time delay between photon detection at T and R. 
g' 2 > (0) = 0 for a single photon, because Ptr = 0. 

If we allow the time delay to be long enough, eventually Ptr > 0 p 0 r very 

long times Ptr -> (Pt){Pr), and S i2} <r -> m) = 1 

For a beam of classical lightwaves incident on the beam splitter, we measure 
intensities: 

h = Tit 

1r = RI; 
and 

F + R= 1 

In this case " ,2> (d) = = ^ 2 , and its value is determined by the statistics 

of the incoming beam. 




The Cauchy-Schwarz inequality , however, specifies that { J f) - { ^' r , so in the 
classical case: 

g {2) (0) > 1 (2.29) 

Thus, the single photon case can be distinguished from the classical case in a 
measurement. 

A note on the losses and non-ideality : 

In a real experiment, single-photon detection is not 100%. In addition, the 
detector has a dark count rate. As a result, when a photon passes through the 
beam splitter and is incident on the detectors, there are four possible results: 

(a) No photon is detected 

(b) A photon is detected at T, but not at R 

(c) A photon is detected at R, but not at T 

(d) A photon is detected at T and at R 

Result b or c is the behavior we would expect from the passage of a single 
quantum through the experimental set-up. Result a is the result of the limited 
quantum efficiency of single photon detectors. A typical value for this efficiency 
is 60%. Result d occurs because the time the detection window is open 
occasionally permits the presence of two photons. Both the dark count rate and 
the detector quantum efficiency can be measured, and this information can be 
used to correct the statistics of the raw data on many measurements. This short 
discussion emphasizes that while the ideal single quantum experiment is 
straightforward to describe, obtaining meaningful results from a real experiment 
requires careful analysis to account for non-idealities of which this is an 
important example. 

Patrick Bronner of the Department of Physics at the Universtat Erlangen has 
prepared this experiment on-line (http://www.didaktik.physik.uni-erlangen.de/ 
quantumlab/english/index.html see “photon existence experiment”), and the 
reader can follow the set-up, the taking and analysis of the data. 

2. Generation of single photons 

Beta Barium Borate or BBO , is a nonlinear optical crystal. BBO’s unique 


combination of properties makes it an excellent choice for second and higher 
order harmonic generation, frequency mixing, broadly tunable sources, and 
parametric down-conversion applications. In the following experiment, BBO is 
used for parametric frequency down-conversion of a single photon of frequency 

^ into two photons of frequency it . Energy is of course conserved. Parametric 

frequency down-conversion is treated in more detail in this book in Chaps. 5 and 
9. 

Conservation of momentum (referred to as phase-matching for photon 
beams) requires that the two lower frequency photons be emitted at an angle of 
each side of the primary beam. This feature enables the easy separation of the 

photons into two measurement circuits as shown in Figs. 2.13 and 2.14. When a 
photon is detected by the gating detector G, then we know with certainty that a 
single photon has been launched into the single photon experiment. 



Fig. 2.13 Non-linear materials, in this case BBO, can be used to generate 2 photons each having half the 

frequency of the incoming beam . These photons are emitted in at a well-defined angle relative to the 

incoming beam (in this case about 3°) allowing for easy separation from each other and from the incoming 
beam 



Fig. 2.14 Detection of a single photon by the gating detector G confirms that a second single photon will 
enter the beam splitter and result in a signal at either R or T, but not both. The results of this experiment 
show this anticorrelation effect which is a non-classical quantum result. Data from this experiment can be 
analyzed in Exercise 2.8 

Experiment no. 1 : Passage through a single beam splitter: measurement of 
photon quantum behavior (see Fig. 2.14). 










This experiment is quite close to Young’s double-slit experiment. There are 
two paths for the photon: transmission or reflection. By putting a detector in 
each arm we can determine with certainty along which path the photon travels. A 
single photon can be detected only once, and it must generate a count either at T 
or at R, but not both. Thus we expect to see anticorrelation of detection events 
confirming 2 results: a single photon in the interferometer and particle-like 
behavior. 

Results of this experiment reported by Thorn et al. (2004) give 
8 i2 ' (r = 0) = 0.0177 ± 0.0026 <£: 1. Quantum behavior is clearly established. The 

result does not show perfect anticorrelation, because of less than perfect 
detection efficiency, for example, and other non-idealities in the experiment. 





Fig. 2.15 The presence of two beamsplitters in the optical path means that we no longer can have 
knowledge about which path the photon follows to reach the detectors. Every photon that passes though the 
interferometer will be detected either at T or at R, but not by both simultaneously, because there is only one 
photon, and it can be detected only once 

Experiment no. 2: Passage through 2 beam splitters: measurement of single 
photon interference and photon wave behavior. 

In this experiment we insert a second beamsplitter and two mirrors, creating 
a two parallel propagation paths (called a Mach-Zehnder interferometer) as 
shown in Fig. 2.15. 

When a photon is detected at G, this heralds the presence of an identical 
photon in the Mach-Zehnder interferometer . By varying the position of one of 
the mirrors as shown, the path length of one arm can be changed relative to the 
other. When the path difference is equal to a multiple of the wavelength of the 
light the photon interferes constructively with itself at the second beamsplitter. 

When the path length is changed by an additional j, the interference is 
destructive. Grangier et al. (1986) first performed this experiment using photon 





pairs generated by fluorescence from an atomic beam of calcium using 2-photon 
excitation, before down-conversion using BBO was recognized as a superior 
source. The results of Grangier et al. are shown in Fig. 2.16. 

The plots in Fig. 2.16 show the number of photon counts as a function of the 
change in the interferometer path length $ L '. The groups (a), (b), and (c) 

correspond to increasing integration times. In Fig. 2.16a the integration time is 
short enough that the recordings of individual photons detected at T or at R are 
clearly seen. In Fig. 2.16c, the integration time is long enough that the clear 
pattern, corresponding to wavelength interference, is seen. As expected, when 
the detector T receives a maximum signal, detector R is at a minimum and vice 
versa. In the intermediate regions where the signals at T and R are comparable, 
the single photon has similar probabilities for transmission to T or to R. It is 
important to remember that this cannot correspond to the classical interpretation 
that the intensity of the wave is divided by the beam splitter, because a single 
photon cannot be detected more than once. In the single photon case, all the 
intensity is collected by either T or R. This behavior is summarized by the 
necessity of wavefunction collapse upon measurement which is an important 
component of the Copenhagen interpretation of quantum mechanics. 
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Fig. 2.16 Recording of single photon arrivals for three different detection times; (a), (b) and (c). In each 
case, the horizontal axis corresponds to the length of travel in the interferometer. At any specific length, the 
detection amplitudes for the detector T are perfectly anticorrelated with those for the detector R (reproduced 
from Grangier et al. (1986) with kind permission of Europhysics Letters, EDP Sciences, 17, av. du Hoggar, 
91944 Les Ulis, France, www.edpsciences.org) 

2.4.4 The Copenhagen Interpretation of Quantum 
Measurements 

The Copenhagen interpretation is a set of guidelines for understanding and 
interpreting experiments on quanta, and for connecting theoretical calculations 
with experiment. It was developed mainly by Bohr and Heisenberg in the first 














half of the twentieth century. For a number of years following in the twentieth 
century, these guidelines provided a useful framework for communication 
among physicists, acting like the “holy tablets” passed down by the “founding 
fathers” of the quantum revolution. However, there is as yet no theory to explain 
the physics of wavefunction collapse. 

The progress in instrumentation during the last two decades now enables 
“table-top” experiments that measure the behavior of single quanta. The group of 
Jan-Peter Meyn at the Universitat Erlangen-Niirnburg has created on-line 
versions of key experiments in quantum photonics. The user can pilot the 
measurement, record and analyze data (http://www.didaktik.physik.uni-erlangen. 
de/quantumlab/english/index.html). In these circumstances, physicists around 
the world can make measurements on the most basic of all elementary particles: 
the photon. This is an area of very exciting research, the results of which are 
addressing questions of what constitutes knowledge and reality. 

Current experimental results in this field are causing an examination of the 
Copenhagen interpretation. One objective is to replace the Copenhagen “rules” 
with physical theory that explains observed behavior. We will not attempt to 
review the directions of current thought here, but the reader is directed to the 
Wikipedia page on The Copenhagen interpretation (http://en.wikipedia.org/wiki/ 
Copenhagen_interpretation) which gives an idea of the range of the current 
debate. 


2.5 Quantum Entanglement 

Measurements of more than one photon can be divided into two categories. In 
the classic case photons behave independently. Photons that pass through the 
double-slit apparatus, for example, usually do so independently, and do not 
interfere with each other. Each photon does interfere with itself. Such photons 
are called separable quanta. If we can write the states of two such quanta as 3- 

dimensional vectors: W = (l*i) oi\h) a 3 I h) ) and U*) = { b\ hi} fol/ 2 } bj l/a) ) } 

then the combined state of l^> and 1^) is written as a tensor. If this can be 
accomplished by a straightforward multiplication: 


IT> - \A) ® I B) = 


a \b\ \h,jy) 

aib\ |i 2f /i > 
, ajbi |fWi> 


a\l>2 I h,h) 

ii j 1 * 2 1 /2 ) 
Q^bj 1*31 h) 


a \b 3 lh, jh) 
Qlbs 1 * 2 y jj) 

a 3 b 3 \hr> ; 


(2.30) 


then the constituent wavefunctions can be measured independently and are 
separable. 






However, this is only one of the ways to write the matrix. In general the basis 
states i and j are not equivalent and the result of the combination is different. If 
the combinations are made in any other way, then the wavefunctions cannot be 
measured independently, and the wavefunctions are said to be entangled. 
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The case of photon polarization is important because this observable is used 
to characterize photon quantum effects. 

We can express the polarization state of each photon as combination of 
parallel and perpendicular states: 


|A) = Hi If) + 02 |—0 and |Zf) = b\ |f) + hj |—>) 

A combined but separable wavefunction is written: 
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On the other hand, the combined wavefunction, with zero net polarization: 
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(2.32) 


is entangled, because it cannot be expressed as the product matrix shown in 
(2.30). 

When two (or more) quanta are entangled, then they behave as a single 
particle with respect to measurement of the entangled property. Thus the 
constituent quanta are correlated. A measurement on one of the constituent 
quanta becomes a simultaneous measurement on all the constituent quanta, even 
though they may appear to be spatially separated. Entanglement is closely 
related to the macroscopic concept of entropy. 

As an example, parametric down conversion creates a pair of entangled 
photons with zero net polarization. Prior to a measurement, the polarization of 
each photon is unknown. However, the net polarization of the pair is known. A 
measurement of polarization on one of the photons must indicate either parallel 
or perpendicular orientation of the polarization. As a result, the other 
unmeasured photon must assume the opposite orientation in order to conserve 
the feature that the pair has zero net polarization. Such an event represents a 
transition from the quantum mechanical world where particles may exist in a 
superposition of possible states to the classical world where particles must 
assume specific values. 






2.6 Summary 

The photon is an elementary particle characterized by its frequency and 
polarization. The photon displays both wave-like and particle-like behavior. 
Maxwell’s equations describe the electromagnetic fields in the classical regime, 
including propagation of photon beams as a function of frequency and 
polarization. Maxwell’s equations represent a staggering accomplishment in 
physical theory, but they do not give a complete description of photon behavior. 
It was subsequently shown by Planck that the energy carried by a photon beam 
radiated from a body heated to a temperature T could be properly explained only 
if each photon were carrying a quantum of energy proportional to its frequency. 
Maxwell’s equations are often not well adapted for describing the behavior of 
the photon as an individual quantum. 

Single photon behavior can be studied by using spontaneous parametric 
down conversion to transform one photon into a pair of photons, each of which 
having one-half the frequency of its parent. (Spontaneous parametric down- 
conversion is a good example of photon behavior that cannot be accounted for 
by Maxwell’s equations.) A basic tool for studying single photons is the 
beamsplitter. The beamsplitter can be analyzed in the classical regime using 
Maxwell’s equations, but this analytical approach fails when faced with the 
propagation of a single photon through a beamsplitter apparatus. This failure, 
however, does not prevent the single photon from propagating through the 
beamsplitter. The photon wavefunction is seen to be present in both exit paths of 
the beam splitter, and when reflecting mirrors are used to bring the paths to 
overlap, interference, that is, self-interference is recorded with results that echo 
Thomas Young’s demonstration of two-slit interference in the early nineteenth 
century. This and related experiments show that the photon is a particle with 
wavelike behavior, and that as a single “particle” it can be present over an 
extended macroscopic region of space. This kind of non-local behavior of the 
photon is a fundamental building block of important applications like quantum 
cryptography and quantum computing. 


Exercises 

2.1 Using MATLAB , calculate the transmission coefficient of a photon beam 
through a plate of glass (Si0 2 ) using the parameters and following the 

procedure given in Example 2.1. Plot the dependence of the transmission 
coefficient as a function of wavelength and as a function of frequency. 



Identify the wavelength above which oscillations in the transmission 
coefficient no longer occur. 

2.2 Using Maxwell’s equations, derive the expression for the transmission 
coefficient and the reflection coefficient for an electromagnetic wave 
passing from one dielectric material to another with the electric field 
polarized parallel to the plane of incidence. These expressions are Fresnel’s 
3rd and 4th equations. 

2.3 Derive the analytical expression for photon propagation through a plasma 
layer when the wavevector is an imaginary number (*2 = '*2 where A '- is 
real). 

Show that: 


cosh 2 {kitt) + t l j sinlr (fcza) 

2.4 A general interface between two planar regions having indices of refraction 
and n " is shown in Fig. 2.17: 



*.-T 


Fig. 2.17 Diagram of index of refraction versus distance for Exercise 2.4 


Show that the general expression for the transfer matrix that summarizes the 
boundary conditions at the interface at z = Z is: 



{1<1 -R)e~ iZA 
{! -R)e iZA (1 +R)e~ iZB 


where 
















A = k\ + kj 
B - kj- h 

k i is the wavevector on the left-hand side and k 2 is the wavevector on the 
right side of the interface, and Z is the coordinate of the interface. 

2.5 Compute the transmission coefficient for photons passing through a grating 
formed by 5 planar layers of alternating index of refraction, making use of 
the expression in Exercise 2.4 for the transfer matrix (Fig. 2.18). 
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Fig. 2.18 Model structure for a five-layer planar dielectric stack 


Evaluate your computing routine using: 

n | = 1.5 
n 2 = 1.9 

Thickness of each region = 500 nm 

Over a frequency range: Id 14 Hz < b> < 10 15 Hz 

Make a plot of the transmission and reflection coefficients as a function of 
free-space wavelength. 

Hint : write a subroutine that determines the transfer matrix for two adjacent 
interfaces of the structure. 

2.6 Calculate the transmission coefficient as a function of free-space 

wavelength for photons passing through a dielectric stack of 11 planar 
layers of alternating index of refraction, making use of the calculation 
routine developed in Exercise 2.4. Refer to Fig. 2.19. 

(a) Show that the effect of adding additional layers is to create bands of 
transmission and reflection. 


(b) Demonstrate which parameters of the calculation determine the width of 
the transmission bands 


(c) What is the maximum value of reflectivity that you can achieve? 
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Fig. 2.19 The 11-period structure. The index of refraction is changed periodically, simulating a 
multi-layer dielectric filter 


Evaluate your simulation using 
= 1-5 width = 500 nm 

tt 2 = 3,0 ? width = 500 nm 
500 nm < A f ree . ipat:e < 1500 nm 


Such a structure is called a photonic crystal, because it is periodic structure, 
like a crystal, but with a period that is similar to the wavelength of light that 
propagates through the structure. 

2.7 By introducing a defect in the structure of Fig. 2.19 we can change its 
optical properties. There are different types of defects: 

Vacancy: a layer can be left out of the stack 

Impurity: a layer can have a different index of refraction 

Phase: a layer can be placed at a non-periodic location 

Each of these defects can be modelled in a straightforward way using the 
transfer-matrix method. In this example we show the effect of introducing a 
phase defect. We advance the position of the 6th layer in the stack by 30% of a 
full period as shown in Fig. 2.20. 
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Fig. 2.20 Periodic dielectric contrast grating. The sixth period has been replaced by a phase defect 

Referring to Fig. 2.18, introduce a defect in the 11 period structure by 
enlarging the lower index layer in the center of the periodic structure. 

Using the same parameters as those for Exercise 2.5, substitute a if phase 

defect for the 6th period. In a simple form, this could be a section 250 nm in 
length with n = /J i 


(a) Calculate the transmission coefficient as a function of free-space 
wavelength and compare to your result in Exercise 2.4 

(b) Show that the presence of a wider layer introduces a region where 
transmission can occur in the middle of a region where transmission is 
normally forbidden. (This is called a defect mode, and is used to achieve 
single-mode operation of lasers that use photonic crystal reflectors as 
mirrors.) (see Fig. 2.21 for an example simulation) 

2.8 Some data taken from the experiment “Existence Photon” on the Quantum 
Lab website and described in Fig. 2.14 are summarized in Table 2.1. 
Detector D1 refers to the transmitted photon and detector D2 refers to the 
reflected photon. T refers to the trigger which heralds the arrival of a single 
photon at the beam splitter. These data are taken for a series of laser power 
used to generate photon pairs by spontaneous parametric down conversion 
(Fig. 2.21) 


Table 2.1 Data for the passage of a single photon passing through a beam splitter 


Counts D1 

971,650.700 

781,706.300 

578,977.667 

384,096.367 

193,297.533 

96,938.467 

Error D1 

1928.581 

1295.531 

958.530 

577.874 

409.237 

324.212 

Counts D2 

960,647.500 

764,864.200 

560,194.467 

369,650.367 

185,951.100 

93,124.300 

Error D2 

1455.367 

964.423 

608.047 

604.612 

516.928 

338.532 





















Counts T 

2,270,792.800 

1,793,066.333 

1,302,454.000 

856,532.200 

429,910.333 

214,817.333 

Error T 

4131.687 

2195.611 

1307.544 

1099.080 

715.257 

479.135 

Counts D1 
and T 

221,116.333 

178,162.100 

131,663.400 

87,088.800 

43,596.467 

21,692.967 

Error D1 and 

T 

713.366 

491.231 

454.170 

265.738 

200.273 

145.135 

Counts D2 
and T 

221,075.567 

176,539.133 

129,278.733 

85,011.100 

42,624.267 

21,126.767 

Error D2 and 

T 

628.030 

508.930 

414.670 

317.893 

220.700 

134.916 

Counts D1 
and D2 

971.133 

632.467 

344.633 

149.967 

36.267 

10.067 

Error D1 and 
D2 

29.586 

24.787 

20.830 

15.721 

6.236 

3.552 

Counts D1 
and D2 and T 

391.900 

256.467 

141.033 

59.500 

15.133 

4.033 

Error D1 and 
D2 and T 

20.444 

15.885 

12.971 

9.149 

3.875 

2.251 

g^(0) without 
trigger 


1.024 

1.028 

1.022 

0.976 


Error g^(0) 

without 

trigger 

0.031 

0.040 

0.062 

0.107 

0.168 

0.381 

g^(0) with 
trigger 


0.015 

0.011 

0.007 

0.004 


Error g^(0) 
with trigger 

0.001 

0.001 

0.001 

0.001 

0.001 

0.001 

Laser-Power 

(pW) 

500.000 

400.000 

300.000 

200.000 

100.000 

50.000 

Measure- 
Time (s) 

60.000 

60.000 

60.000 

60.000 

60.000 

60.000 

Number of 

measurements 

30.000 

30.000 

30.000 

30.000 

30.000 

30.000 


The experiment set-up is shown in Fig. 2.14. D1 corresponds to 
transmission, while D2 corresponds to reflection. There is a “trigger” 
detector T that heralds the presence of a single photon at the beam splitter. 
These data are extracted from the website: Quantum Lab, http://www. 
didaktik.physik.uni-erlangen.de/quantumlab/english/index.htm, “Existence 
Photon” and used with the kind permission of Prof. Jan-Peter Meyn 




























Fig. 2.21 Sample simulation of defect in a dielectric stack. This result shows the effect of a quarter- 
period phase defect inserted in a dielectric stack. The defect creates a highly transmitting, narrow 
bandwidth mode in the center of a highly reflecting (R > 0.9999) band 


(a) Complete the table by calculating the second-order correlation 
coefficient (Q) for the measurements with trigger and without trigger 
for laser power = 500 and 50 pW. 

(b) Comment on the relationship between the determination of X 2 (0) and 

laser power that you observe in the two cases of trigger and without 
trigger. 
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Abstract 

Solutions to the Schrodinger wave equation are used to study the allowed states 
of electrons and holes in the presence of a one-dimensional potential. In this 
analysis, we treat the electron as a wave with a single wavevector. We study the 
transit of an electron-wave through a series of potential barriers using the 
transfer-matrix method to determine the transmission coefficient as a function of 
kinetic energy. The Schrodinger equation shows that an electron can be 
transported through a potential barrier when its kinetic energy is not sufficient to 
surmount the potential barrier. This tunneling effect is an important contributor 
to reverse breakdown in p-n junction diodes. Tunneling behaviour is more 
prominent in the transport of electrons through a sequence of potential barriers. 
As the number of barriers increases, the range of allowed energies for tunneling 
broadens into a transmission band. These bands are directly related to the 
electronic bandstructure of crystalline materials. Particle-like behavior of 
electrons can be described by constructing a wavepacket. The electron 
wavefunction consists of a sum of different wavevectors leading to localization 
in space and time of the electron wavefunction. This representation is needed to 
address transit time between two points in space. 


3.1 Introduction 




Crystalline semiconductor materials are typically grown as a series of epitaxial 
layers on a plane, single crystal substrate. The thickness of the layers can be 
controlled accurately from a nanometer to several micrometers. While charge 
carriers: electrons and holes, are largely free to move in the plane of these layers, 
transport in third dimension is modified strongly by differences in the band gaps 
of the layers composing the structure, leading to charge carrier confinement, 
tunneling , and local acceleration among some of the more interesting effects. 
Charge carrier confinement is the essential component of the silicon MOSFET, 
the room-temperature semiconductor injection laser and the quantum Hall effect. 

In this chapter, we will study the allowed states of electrons and holes in the 
presence of one-dimensional potentials corresponding to the different bandgaps 
in the structures under consideration. In many instances we will treat the electron 
as a wave with a single wavevector. Solutions to the Schrodinger wave equation 
are quite accurate, to the point that they are now used as the tool of choice for 
design and analysis of semiconductor devices that depend on energy-band 
heterostructures for their operation. This analysis is called bandstructure 
engineering. 

Particle-like behavior of electrons can be described by constructing a 
wavepacket. The electron wavefunction consists of a sum of different 
wavevectors leading to localization in space and time of the electron 
wavefunction. This representation is needed to address transit time between two 
points in space. 


3.2 The Schrodinger Equation 

Following the experimental verification of de Broglie’s proposal that electrons 
(and holes) have a characteristic wavelength, we can write down the wave 
function of a one-dimensional free electron as follows: 


V (x) = Ac? kx 

The time-independent Schrodinger equation was presented in (1.17) 


(3.1) 


2m* dx 2 


*P (x) + V (x) T (x) = E'V {x) 


Schrodinger’s equation is a statement of conservation of total energy: 


(3.2) 


Kinetic energy + Potential energy = Total energy 

The solutions of Schrodinger’s equation give the stationary states for which 
total energy is conserved. 




3.3 Quantized Energy Levels for Electrons in an 

Infinitely Deep Potential Well 

3.3.1 Potential Well with Infinitely High Boundaries 

A simple point of departure is the case of energy levels for electrons confined in 
a potential well with infinite potential barriers on each side (Fig. 3.1 ): 
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Fig. 3.1 Particle energy versus real-space distance in one dimension. The width of the well is given by L. 
The energy at the bottom of the well is set to 0 eV. The solution to the Schrodinger equation is a free- 
electron wavefunction, The particle wavevector k is imaginary in the barrier region and a real number in the 
well region 

The general solution to (3.2 ) in each region has the form of two counter- 
propagating plane waves: 

¥ (x) = Ad kx + where k = (E - V) (3.3) 

The constants A and B are determined by application of boundary conditions. 














3.3.2 Boundary Conditions 

The boundary conditions are an essential input to the physics of the problem. We 
will treat them region by region. 

Region I: TOO = 

In region I, V is a positive real number, so k is purely imaginary. This means 
that the wavefunction consists of 2 terms, one that grows exponentially and the 
other which decays exponentially. The exponentially growing term is unphysical, 
so the coefficient A = 0. 

Because the value of V is infinite, the remaining term is 0 everywhere, and B 
remains undetermined. TOO = d, for all x ^ - j. 

Region III fix) = f**' + GT'*’ 

The boundary condition applied to region III is similar to that for region I. 
The coefficient F is set to zero and the remaining term is also 0 because the 
value of V is infinite. G remains undetermined. TOO = 0 } for all x ^ j. 

Region II fW = Ce“’+ Dc-*’ 

h 2 k 2 

In region II, V = 0, and E = The boundary condition is continuity of the 


wavefunction across the boundaries. This means 




d and requires 


that kL = n = 1, 2, 3... 

It then follows that - D in order to assure that the wavefunction is zero at 
the boundaries of the well. 


Solution: 

For x < x > | : T U) = 0 
For - k < x < j, 

k is quantized: 

wr 

T 

The quantized energy levels are given by 

n 2 jf-Tr 
~ lUm* 

The lowest-lying solution, for n = 1 is TOO = ^ cos(|.rj 


(3.4) 


E, 


(3.5) 



The value for C can be determined by normalizing the wavefunction to unity 
in the well: 

L 

j C 2 cos 2 = 1, C = (3.6) 

— J 

For n = 2, the wave function is TO) = iji sin {f'- ir ). 


For x>k-. TU) = 0, 


In general the wavefunctions are: 


TO) = 


^icas(fx), 

\ VMtT 


n = 1,3,5,.,, 
n =2,4.6._ 


(3.7a, b) 


The first three quantized energy levels are shown in Fig. 3.2. These are 
calculated using (3.5), with a well width of 10 nm and using an effective mass of 
0.066 which is that of an electron in the conduction band of GaAs. 



2 2 

Fig. 3.2 Quantized energy levels in a well having a width of 10 nm and infinitely high potential barriers. 
The electron effective mass = 0.066 mg 









3.4 Electrons Confined by a Square Potential Well with 
Finite Barriers 

The previous analysis of the square potential well is helpful in understanding 
some general behavior of confined electrons in semiconductor heterostructures. 
However, real heterostructure devices must be designed with finite barriers. Both 
the analysis and the results have significant differences compared to the infinite 
potential model. In addition, the finite potential barrier model that we present 
here gives a precise and quantitative account of electronic energy levels in 
important semiconductor devices such as lasers and modulators. 

As illustrated in Fig. 3.3, the potential barrier is defined: 



(3.8) 
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Fig. 3.3 Diagram of a potential well with finite barrier height. The parameters for electron effective mass 
and barrier height are specific for the heterostructure: GaAs/AlpjGaojAs 


Schrodinger’s equation is: 












fr d~ 

+ V (X) * (X) = (X) 

where m * = m l in the barrier material, such as Al x Ga-i_ x As, and in< 


(3.9) 

= in the 


well material, such as GaAs. 

We are principally interested in the range of electron energies < where 

electrons are confined by the potential well. As before, we can write the solution 
to (3.9) in the three different regions. 


3.4.1 Boundary Conditions 

There are 2 boundary conditions which are necessary to assure that the physics 
of the situation is respected: 


(i) continuity of the wavefunction across the boundary. 


(ii) continuity of the electric current across the boundary. 

This second condition arises because electrons are not confined entirely to 
the well, due to the finite nature of the confining potential. Continuity of current 
is a physical requirement that assures conservation of charge throughout the 
heterostructure. 

The current carried by one electron is: 


m'v hk . h d 

I = -qv = -q -= -q — = iq -— T (jr) 

m* m" m*dx 


The boundary condition that conserves electric current is therefore: 


(3.10) 


<*)*=£ = —-rVn&omu 
m w dx 5 z nt B dx 5 i 

D.J. BenDaniel and C.B. Duke proposed and explained this requirement in a 
landmark publication. Nearly twenty years later, G. Bastard showed the 
importance of these boundary conditions for the physics of semiconductor 
heterostructures (BenDaniel and Duke 1966; Bastard 1981). 

Applying these conditions at * = I, 


Ce ik ^ + De~ ik ? = 

and 


— {ce ik i - = -—Fe~*t 

m* w \ / m*. 





These results can be combined: 


±( C ^-D).-^(C^ + D), (3.11) 

rTt W B 

giving a homogeneous linear equation for the unknowns C and D. 

Ce ikL + ikm^) + D - ikm^) = 0 

Applying the same procedure at the boundary x = _ j gives: 


Ce" ,kL - ikmg) + D {xm\ ¥ + ikm = 0 (3.12) 

A solution to (3.11) and (3.12) exists if and only if the determinant of 
coefficients is equal to zero: 

2i (*m^) 2 sin (kL) + Aikrn^m^, cos(jtL) - 2i (£m^) 2 sin (kL) = 0 


cot (kL) = 


Ikm^xm^ 


(3.13) 


Equation (3.13) gives the energy of the quantized states in the potential well. 
It can be solved by numerical iteration to any degree of accuracy desired. It is 
helpful to view the overall situation graphically as shown in Fig. 3.4. As is often 
the case, solving for the wavefunction is of secondary interest. 


3.4.2 Discussion of Results for Quantization of Energy 
Levels in a Square Potential Well 

The results for the case of an electron confined in an infinite potential well are 
intuitive. Since we treat the electron as a wave, then it follows that the allowed 
states will be those for which an integral number of wavelengths can fit between 
the geometrical boundaries of the potential well. The mathematics of this model 
system corresponds to the case of sound waves in an organ pipe or the vibrations 
of a guitar string. The energy of a free electron is proportional to the square of its 
momentum, and from the de Broglie relation, this implies that the energy is 

proportional to the inverse square of the wavelength: i^) 2 = (. Thus, the 

quantized energy levels increase in proportion to n 2 = 1::4::9, etc. The solution 
to the equations for the model system of the square potential well of infinite 
height is straightforward. The results, however, do not give a good 
approximation to those for electron confinement by real potential wells, such as 
those used to make semiconductor lasers. 

In a second case, we have considered electron states confined by a square 



potential well of finite height. A significant feature of this system is the 
penetration of the electron states into the barrier. One obvious consequence is 
that the physical dimension of confinement is larger than the physical dimension 
of the potential well. This causes the quantized energies to be lower than those 
found for the case of the infinite potential well. A second important result is that 
the penetration of the electron wave function in the barrier region requires 
boundary conditions that properly reflect the physics of the situation. 

In Table 3.1, we summarize the calculation of the energy of the confined 
states in the two cases, for a potential well having a width of 10 nm. 

Table 3.1 Comparison of the first three quantized energy levels for electrons in GaAs, confined in a 
potential well of width 10 nm 



v 0 

m *B 

m*\y 

Ei (eV) 

E 2 (eV) 

E 3 (eV) 

Infinite well 

00 

- 

0.066 

0.056 

0.223 

0.502 

Finite well 

0.25 eV 

0.092 

0.066 

0.03 

0.121 

0.245 


In the case of an infinitely high potential barrier, the quantized energies are 
significantly larger than the example of a finite barrier height based on 
Al 0 3 Ga 0 7 As as the barrier material 

Note that the quantized energies for the case of the infinite well follow the 
sequence: E 0 , 4E 0 , 9E 0 . For the finite well the series is; E 0 , 4E 0 , 8.2E 0 . This 

illustrates that the highest energy state is not as strongly confined by the 
potential barrier. Indeed, it lies only a few meV below the barrier height. Weaker 
confinement means that the wave function extends farther into the barrier region 
than is the case for the n = 1 and n = 2 states. This lowers the confinement 
energy. 

The finite barrier model presented here has been shown to give accurate 
quantitative results for both electron and hole energy states in a wide variety of 
semiconductor heterostructures. By measuring the energies, one can determine 
the physical width of the well, and this calculated width can be compared to 
direct measurement by electron microscopy. The agreement between calculation 
and experiment confirms that electronic states are free-electron-like and that the 
boundary conditions used to solve Schrodinger’s equation are correct. 

3.4.3 Energy Levels in a Quantum Well in the Presence 
of an Electric Field: The Quantum-Confined Stark Effect 

The application of an electric field (Stark effect) changes the relative energy 











separation between adjacent quantum levels (Fig. 3.5). The energy shift can be 
calculated using perturbation theory. 

The Hamiltonian of the perturbation is expressed: 

hi = q£x (3.14) 

We will consider the modification to the lowest-lying (n = 1) level. The first 
order change is expressed 

AE\ = ■C'f'il 4 £jr|¥|) = 0 by a simple symmetry argument 

Thus, the perturbation to the energy levels must be calculated in the 2nd 
order 


A£, = £ 

jt\ 


|{T^&r|T l )| 2 


|(T 2 | 

(Ei-E 2 ) 


(3.15) 


where we have simplified the calculation by keeping only the first term of 
the sum, that is: j = 2. There are good reasons for this: 


1. A quantum well in photonic materials rarely has more than 3 levels. 

2. The magnitude of subsequent terms will be reduced by the larger energy 
level difference. 

3. The matrix element is zero by symmetry whenever j is an odd number. (This 
is demonstrated in Sect. 7.5.2) 

We can evaluate (3.15) in the case of the infinitely deep quantum well using 
the wavefunctions obtained in (3.7a, b), in order to estimate the magnitude of the 
effect. 


A Ei 



L 

V 2 / 


2 q 2 & 2 m*L 4 

3 x) h 2 


(3.16) 


For a quantum well of GaAs having a width of 10 nm, the shift in energy is 
about 2 meV in an electric field of 10 5 V cm -1 and is shown in Fig. 3.5. 


3.5 Quantum-Mechanical Tunneling 

In this section we will investigate the transmission of an electron or other 











wavelike particle through a potential barrier of finite magnitude. This situation is 
nearly the reverse of that discussed in the previous section. Wavelike behavior 
permits particle transmission through a barrier even though its total energy is 
less than that of the potential. This phenomenon has no classical analog. 
Friedrich Hund was the first to take notice of tunneling in 1927 when he was 
calculating the ground state of the double-well potential. Its first application was 
a mathematical explanation for alpha decay, which was done in 1928 by George 
Gamow and independently by Ronald Gurney and Edward Condon. They 
simultaneously solved the Schrodinger equation for a model nuclear potential 
and derived a relationship between the half-life of the particle and the energy of 
emission that depended directly on the mathematical probability of tunneling. 

The study of semiconductors and the development of transistors and diodes 
led to the acceptance of electron tunneling in solids by 1957. Leo Esaki 
demonstrated tunneling in pn-junction diodes for which he received the Nobel 
Prize in Physics in 1973. 

Tunneling of particles is an important event in many areas of science and 
technology, including black holes, elementary particle decay, superconductivity, 
radar, chemical reactions, and integrated circuits to name just a few. The 
scanning tunneling microscope created a new window of surface science 
allowing direct imaging in real space with atomic resolution. Gerd Binnig and 
Heinrich Rohrer were awarded the 1986 Nobel Prize for demonstrating this 
capability. Recently, tunneling has been harnessed and controlled to produce a 
new kind of semiconductor laser, based on tunneling injection of electrons into a 
square potential well. These devices are the subject of Chap. 8. The diagram of a 
one dimensional potential barrier is shown in Fig. 3.6. 

In this analysis, we suppose that a particle-wave is incident on the barrier 
from the left. In the classical case, the particle is reflected with 100% probability 
if its kinetic energy is less than the barrier height It will be transmitted with 

100% probability if its kinetic energy is greater than . in the quantum 

mechanical case there will be both reflection and transmission of the particle- 
wave at all energies. 

The reflection coefficient is the probability that the particle is reflected: 

(3.17) 

where a and is are the amplitudes of the incident and reflected wavefunction 
respectively as shown in Fig. 3.6. The transmission coefficient is the probability 
that a particle incident in region I is transmitted to the right in region III (Fig. 
3.7): 
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(3.18) 
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Fig. 3.4 Plot of cotangent (kl) (in blue) versus Tr~r—; (in red). This calculation is based on a well 


width of 10 nm and parameters for the well and barrier regions based on GaAs and Alo. 3 Gao. 7 As 
respectively. For the details, refer to Sect. 3.4.2 































Fig. 3.5 Schematic diagram of a GaAs/AlGaAs quantum well in an electric field. An electric field of 

£= 10 " V dll 1 results in a voltage drop of 0-1 ^ across a 10 nm quantum well. The unperturbed 

quantized energy levels are shown as solid lines. These are positioned in the quantum well with the 
unperturbed energies (E = 0.03 eV and E = 0.121 eV) at the center of the well. The Stark effect pushes these 
levels apart, and the perturbed energies are shown as dashed lines 

The particle in the geometry of Fig. 3.6 is nowhere confined. As a result, a 
continuous range of particle energies is possible. We are interested to learn about 
the conditions for tunneling through the barrier. To do this we need to develop an 
expression for the transmission coefficient. As before we divide the analysis into 
three regions. 
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Fig. 3.6 Particle transmission in the presence of a potential barrier of width L and height Vq 
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Fig. 3.7 Transmission coefficient versus incident energy, for a free-electron incident on a potential barrier 







of 0.25 eV. The barrier width is 10 nm. The electron mass is taken as the free-electron rest mass throughout. 
The transmission coefficient remains less than 1% even as its energy exceeds the barrier height 


Region I 

The particle wave is incident from the left. In region I there will be both the 
incident wave travelling to the right, and a reflected wave traveling to the left: 

T (x) = A^ kx + Be~ kx (3.19) 

The particle effective mass is m l. The wavevector * = s V^i^. 

Region III 

The situation is identical that of Region I 

T (_t) = F^ kx + Ge' ikx (3.20) 

Note that the term f ^"' L represents a traveling wave incident from the right 

and moving to the left. In this tunneling model we could define = d. However, 

there are some advantages to carry this term to the end of the analysis in order to 
develop a general solution that can be used with more complicated structures. 

Region II 

The wavefunction can be expressed in the same way 

T (_t) = + De~ ikx (3.21) 

In this region, * = s ^ ^. 

In the range of energies where tunneling occurs, k is a pure imaginary, so we 
define: 

k = ik 

Substituting in (3.18) 

T (x) = Ce~ KX + De KX (3.22) 

We use the same two boundary conditions which are necessary to assure that 
the physics of the situation is respected: 

(i) continuity of the wavefunction across the boundary. 

(ii) continuity of the electric current across the boundary. 

We apply the boundary conditions at each interface. For example, at the 
interface between regions II and III: 




^re giirrtl! — ^region!H (Xh-4 

and 


(3.23) 


nt d^ Te ^ imU {X)x= ? mt dx ' ¥r * si ‘ m111 {X)x= i 


(3.24) 


In a similar fashion, we apply the boundary conditions at the interface at 


x = 
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2 • 


Ae ~ ik t + Be*? = Ce+ De~ K ? 


Jk L 


and 


, ik _ & l .. ik Jk £ k k l a j l 

A—e 2- B —X 1 2 = -C— e 2 + o—e 2 
m* m| m* m* 

Next, we solve for A and B: 
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1 + 
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tn*k 
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This is convenient to write in matrix format: 


(3.25) 

(3.26) 
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Applying the same boundary conditions at the next interface gives: 


Ce~ K ? + De K i = Fe ik ? + Ge~ ik ? 

and 


(3.27) 


(3.28) 


« A _ L k L tk _ Ik 

-C—e 2 +D—e 2 = f —e 2 - Q—e 2 
m* 2 m* m\ mj 

Solving for ’C and D, we obtain: 
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(3.29) 


and 
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In matrix format: 
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(3.30) 


Substituting (3.27) in (3.24), gives the relationship between the incident and 
transmitted wavefunctions, which we can write in symbolic form: 
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(3.31) 


+ A-/_| A-/ 7 A/jMfc + Af 4 Af fi ; 

In (3.28), there is one term that allows the calculation of the transmission 
coefficient: 

A = (Mi+ M 2 M 7 ) F + (A/ 1 A/ 6 + M 2 M S )G = ( M\M 5 + M 2 M 7 ) F, since G = 0. 
Therefore, 
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Af ,.W 5 + cosh(ii) _ i (g _ *=j) s i„h<*i) 


f| 

4 I 


(3.32) 


(3.33) 


cosh 2 <aL) + (i)fe| - sinh 2 (a-L) 

Note that (3.32) is easily adapted to the case where L > v n. The variable K is 
replaced by and the transmission coefficient can be calculated as shown here. 


This approach can be extended in a straightforward way to structures having 
a series of potential barriers with arbitrary material properties, barrier height and 
position. The passage through each interface is treated by a 2 x 2 matrix that 
summarizes the boundary conditions. 

To appreciate the quantum mechanical behavior of the particle, we will plot 
out the transmission coefficient as a function of incident particle energy (see Fig. 
3.7). For the examples that follow, the barrier width is 10 nm and the barrier 
height is 0.25 eV. 


Case I m \ = m i x 10 ”" 1 k E 


The first observation is that transmission through the barrier is negligibly 














































small except where the incident kinetic energy of the particle approaches that of 
the barrier potential. Even in this case, the transmission coefficient does not 
exceed 6 x 1CT 3 . The transmission coefficient for a wider spectrum of energies 
both below and above the barrier potential is shown in Fig. 3.8. This result is 
analogous to that shown in Fig. 2.9 for photon transmission through a plasma 
layer. 



Fig. 3.8 Transmission coefficient versus incident particle energy using the same conditions as in Fig. 3.7. 
The transmission coefficient reaches unity for particle energies above the barrier potential, but only at 
discrete energies, showing that the barrier reflects the particle with significant efficiency, even when the 
incident energy of the particle is several times larger than the barrier height 


In the classical case, the transmission coefficient = 0 for energies less than 
the barrier potential energy. The transmission coefficient is 1 for energies greater 
than the barrier potential. The analysis of particle behavior in the quantum 
regime is different. For energies less than the barrier potential, the transmission 
coefficient is indeed small compared to unity, but as we have shown in Fig. 3.7, 
it is not zero, and its value is a function of incident energy. For incident energies 
greater than the barrier height, the envelope of the transmission coefficient tends 
toward unity, while the coefficient itself displays a series of pronounced 
resonances. These resonances occur at energies that correspond to the cases 
where an integral number of de Broglie wavelengths of the particle-wave adds 
up to the barrier width (Fig. 3.8). 














In this example, m \ - 9 x ' kg. m 2 - 0.066m, # All other parameters remain 
the same (Fig. 3.9). 



Fig. 3.9 Transmission coefficient versus incident particle energy. In this case the effective mass of the 
particle in the barrier region is taken to be 0.066. The result is that the barrier effectiveness is significantly 
reduced, and the transmission coefficient of the particle is close to unity as its incident energy approaches 
the barrier height 

In this case we can see that the transmission coefficient has increased by 
nearly two orders of magnitude when the incident energy is close to the barrier 
height. The magnitude of the transmission coefficient is sensitive both to the 
ratio of the effective masses as well as their absolute values. 

As before, we look at the overall behavior for incident energies both below 
and above the barrier height. 

In Fig. 3.10, the behavior of the transmission coefficient for energies above 
the barrier height is quite different from that shown in Fig. 3.8. There are still the 
pronounced resonances that occur when the particle wavelength is an integral 
multiple of the barrier width. However, the transmission coefficient no longer 
evolves asymptotically toward unity at large energies. The barrier acts like a 
strong scattering centre at all non-resonant energies. Scattering occurs because of 
the discontinuity of the wavevector between region II and the surrounding 
regions. The difference in effective masses adds to the strength of scattering 



( xm* 


l 

in (3.29) which reflects the boundary condition of 


continuity of current. 


3.6 Tunneling in pn-Junction Diodes 

Tunneling is an important phenomenon in semiconductor devices like transistors 
and pn-junction diodes. In this section we will develop a model for tunneling in 
reverse-biased pn-junctions. 



Fig. 3.10 Transmission coefficient versus incident particle energy using the same conditions as in Fig. 3.9. 
The transmission coefficient reaches unity for particle energies above the barrier potential, but only at 
discrete energies. However, the transmission coefficient does not approach unity at high energies, as in Fig. 
3.8. Instead, the difference in effective masses inside and outside the barrier results in a significant 
reflection of the particle wavefunction, even in the classical energy regime 


In Fig. 3.11, we show a schematic diagram of a pn-junction, plotting 
potential energy versus distance. In the model that follows, electrons in the 
valence band are confined in a potential well. The barrier height is the bonding 
energy (energy gap). By tunneling into the conduction band, a valence electron 
leaves the bonding state and transitions to the conduction band where it is free to 
carry current. All electrons in the valence band are candidates for tunneling. 
When a pn junction is reverse-biased, the conduction band and valence band can 
have the same potential energy, separated by a distance L as shown in the 







diagram. At any fixed point in space, the valence band and conduction band are 

separated by the bandgap energy. The equivalent electric field is When 

tunneling occurs, an electron makes a transition from the valence band, across 
the forbidden gap to the conduction band at constant energy. Then it contributes 
to the current density in the conduction band according to the definition: 

J = nqv Silh (3.34) 

where 4 is the charge on the electron, and is the saturation velocity for 

electrons in the conduction band (for common semiconductors, this is typically 

v *a ~ 1 x nis ) The electron density in the conduction band is expressed: 

n = N V T } where N v is the electron density in the valence band and r is the 
tunneling transmission coefficient. 



Fig. 3.11 A schematic energy level diagram for a pn-junction in reverse bias. An electron may tunnel from 
the valence band to the conduction band at constant potential. The tunneling barrier shown in the inset is 
triangular in shape 


Electrons in the conduction band contribute to the reverse leakage current. 
These electrons can gain additional energy from the electric field and initiate 
impact ionization, generating additional carriers, increasing the dark current and 
eventually leading to avalanche breakdown. Tunneling and impact ionization 
each make substantial contributions to the dark current near the reverse 
breakdown voltage. The relative contribution depends principally on the 



















bandgap energy. Impact ionization dominates in materials like silicon and GaAs 
which have a bandgap greater than 1 eV. Tunneling dominates in materials with 
a lower bandgap, like InAs. ( E c = 0.4 cV a t room temperature). An intermediate 

case, germanium, with a bandgap of 0.6 eV, is an example of a material in which 
tunneling and impact ionization make similar contributions to the dark current. 

In the treatment which follows we will use the result that we have obtained 
for the tunneling transmission coefficient to characterize the tunneling current. 
This approach requires some approximations. We are not aiming for quantitative 
agreement with experiment, but rather an understanding of how the tunneling 
current varies as a function of applied bias voltage, and how the tunneling 
current changes with regard to parameters such as the fundamental bandgap 
energy and the effective mass of electrons. Some important approximations are 
discussed below. 

(a) Tunneling barrier. 

The tunneling barrier as shown in Fig. 3.11 is triangular, whereas we have 
carried out a calculation for a rectangular barrier. Obviously the triangular 
barrier will allow more current to pass. A more precise calculation can be made 
to take full account of the actual barrier geometry by using the WKB method to 
handle the boundary conditions (Kane 1959). 

(b) Effective mass 

Electrons in the valence band of a semiconductor like Ge or GaAs are 
present in two bands, each of which has its own characteristic effective mass. In 
the conduction band, the electron mass is different still. The effective mass in the 
fundamental gap region cannot be measured directly. This poses a problem for 
how to assign the mass in regions I, II and III of the simple one-dimensional 
model for free-electron tunneling. In our approximation, we choose the same 
mass throughout the structure, while choosing the effective mass for electrons in 
Ge to be greater than that for GaAs. 

The tunneling current is: 

J Turin — ttifY ^. 

where 


n = N V T 

= 1,2 x 10 7 cm s _l 


(3.35) 

This is the saturated electron velocity in electric fields 


greater than 5 x W ' v cm" 1 , 

N v - 4 valence electrons/atom x 8 atoms/unit-cell/unit-cell volume = 1.7 x 

10 23 cm -3 
and 


1 

cosir {*!.) + (|) - 3f) sinh 2 <*L) 

Using parameters appropriate for GaAs. 

The tunneling barrier is shown in the inset of Fig. 3.11. The electrons of 
interest lie at the valence band edge, and have only thermal energy 


E = 
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(3.36) 


The barrier height ^ - Eg - 1-43 eV The wavevector * in the forbidden gap 


is: 


k = ^ ijlm" (i’i, - and |k| = 1.4 x 10 9 m _l » |A| 

The width of the barrier L can be determined from the electric field: 
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L = 


= 2.7 x 10 7 m, for GaAs, with N D = 10 lb cm 3 and a reverse bias 
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of V = -1 V. 

To evaluate the transmission coefficient, we set m i = = m \ and we note 

that » 1 . The transmission coefficient can be simplified to: 



(3.39) 

E Cl 2m"E c ee 0 ,, i 

aL = — J—- —— = z 

^ 2^jV^V 

(3.40) 

and 


(k\ z 3k s T L 

J = — , where T is the temperature 

(3.41) 

This results in: 
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yielding the tunneling current density in terms of the reverse bias voltage. It 
is crucial to note that the applied voltage does not “ionize” or otherwise energize 
electrons in the valence band. The effect of increasing the reverse bias voltage is 
to decrease the physical width between the valence band and conduction band, 
increasing the probability of tunneling. 

This model is evaluated for the cases of Ge and GaAs in Fig. 3.12, where a 

one-sided pn junction is assumed having a doping level of - 1 x fTcm We 


indicate by the vertical arrows the measured breakdown voltages for both Ge and 
GaAs having this level of doping. The principal result is that the tunneling 
current density in GaAs is many orders of magnitude smaller than in Ge, due to 
the difference in the energy gap, which is the barrier for tunneling. In addition, it 
can be seen that electron tunneling occurs at all levels of applied bias. 
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Fig. 3.12 Comparison of tunneling current density in Ge and GaAs pn diodes. Near the breakdown voltage 
for each diode, the tunneling current density in Ge is more that 20 orders of magnitude larger than in GaAs. 
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The heavy line at 10 A cm is a level indicative of the reverse breakdown regime 














































From a practical viewpoint, one can associate a current density 
J R > 10"-’ A cm - w ith the regime of reverse breakdown. Using this assumption, 

the tunneling current in Ge appears to be an important contribution to the 
breakdown regime. In the case of GaAs, the calculated tunneling current density 
lies well below the breakdown regime when the breakdown voltage is applied. It 
follows that tunneling is not a major contributor to reverse breakdown current in 
GaAs, while it is a major contribution to the breakdown current in Ge pn diodes. 
The carrier concentration Vo can have an important effect on the tunneling 

current because it can be varied over several magnitudes. In Fig. 3.13 we give an 
example of this effect for a Ge p + n diode, with 2 different levels of doping on the 
n-side. 
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Fig. 3.13 The carrier concentration on the lightly-doped side of the pn diode determines the electric field 
and therefore the width of the tunneling region. Reducing this carrier concentration lowers the electric field 
at a given voltage. In this example for a Ge pn diode, lowering the carrier concentration by 50% reduces the 
tunneling current density by one order of magnitude near the breakdown voltage. This dependence is 
important because it is the only recourse available to designers for reducing leakage by tunneling in a 
device of a specific material composition 

Tunneling is a pure quantum-mechanical effect that accompanies the wave¬ 
like behavior of particles: (electrons, photons, protons, and even atoms). 

The tunneling probability increases: 



















as the kinetic energy of the incident particle approaches the barrier 
potential, 

as the physical thickness of the barrier decreases, and 
as the effective mass of the tunneling particle decreases. 

Tunneling is a source of leakage current in pn diodes and thus in transistors, 
when conditions reduce the distance between conduction and valence bands. 
Reducing tunneling current in devices with nanometer dimensions is not 
straightforward. In the case of leakage current in diodes and transistors, we have 
shown that one option is to reduce the free carrier concentration at the junction, 
which serves to widen the tunneling barrier for a given reverse bias voltage. As a 
result, the tunneling current density near breakdown is also reduced, meaning 
that the competing mechanism, impact ionization, may make the major 
contribution to the reverse current. Putting several barriers in series might seem 
like a possible solution, but in the next section, we will see that this is not at all 
the case. 


3.7 Tunneling in the Presence of Multiple Barriers 

Sequential tunneling in the presence of multiple barriers is treated using the 
same approach as that for a single barrier. A one-dimensional model of a two- 
barrier environment is shown in Fig. 3.14. 
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Fig. 3.14 Schematic model for a double potential barrier in one dimension. A boundary condition is 
determined for each interface. Note that positioning of each barrier, its width and magnitude are arbitrary 


















The objective is to develop an expression for the transmission coefficient. 
Using (3.21)—(3.27), we can immediately write out the solution by imposing the 
boundary conditions at each interface: 
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It is evident that this procedure can be extended easily to a large number of 
barriers of arbitrary height and position. 

The transmission coefficient is determined by multiplying these matrix 
equations, giving 
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In Fig. 3.15, we show the result of this calculation for a double-barrier of 
height 0.25 eV and a width of 5 nm for each barrier. The distance between 
barriers is 10 nm. The particle effective mass in the barrier region is 0.092 m 0 

and the effective mass outside is 0.066 m 0 . The results show that transmission by 

tunneling is seen at energies well below the barrier height. At specific resonant 
energies, the transmission coefficient is unity, indicating that the potential barrier 
is entirely transparent at these energies. 
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Fig. 3.15 Transmission coefficient for a double barrier structure . Unlike the single barrier example, 
tunneling with 100% transmission is possible at specific resonant energies well below the barrier potential. 
Resonant tunneling is a dramatic manifestation of the wavelike properties of quantum particles like 
electrons 

Resonant tunneling has been studied theoretically since the 1950s. In the 
general absence of computers, the first approximate solution was achieved using 
the WKB method (Duke 1969). The transfer matrix method developed here is 
more accurate, and solutions are easily implemented using MATLAB or similar 
computing environments (see Harrison 2006 and Wartak 2013). Chang, Esaki 
and Tsu first measured resonant tunneling in a GaAs/AlGaAs heterostructure 
having parameters similar to those used for the results of Fig. 3.15 (Chang et al. 
1974). Resonant tunneling is used to enhance performance of transistors and as a 
means of injecting carriers into quantum well structures at energies below the 
barrier height of the well. This type of injection is key to the operation of the 
quantum cascade laser, for example. 

It is instructive to display these results using a logarithmic scale and this is 
shown in Fig. 3.16. There is tunneling transmission at all energies, although the 
transmission coefficient remains far below unity with the exception of the two 
resonant energies at 0.035 and 0.124 eV. Note that these energies are quite close 
to those calculated earlier for the finite potential well having the same width and 
effective mass ratio. This is an important underlying principle of resonant 
tunneling: quantum mechanical particles can tunnel with near unity transmission 
coefficient through a multiple barrier structure at specific energies. These 








energies correspond to the quantised levels of a finite potential well having the 
same parameters of potential depth, width, and effective masses in the well and 
the barrier. 



Fig. 3.16 Transmission coefficient versus incident energy for a double barrier structure using the same 
parameters as those for Fig. 3.15. Using a log scale reveals that the transmission coefficient increases 
gradually with incident energy, just as in the case of tunneling through a single barrier. However, there are 
in addition specific resonant energies that are closely related to the energies of the confined states in a 
potential well (see Fig. 3.4) 

The transfer matrix method can be extended in a straightforward way to 
calculate tunneling transmission in a heterostructure containing many periods. 
The principal result is that the narrow resonances seen in Figs. 3.15 and 3.16 
broaden into energy bands (Tsu and Esaki 1973). The mathematics of the 
calculation are similar to those used for computing the optical pass bands for 
interference filters, with the changes in refractive index playing an analogous 
role to changes in barrier height used in tunneling. 

In Chap. 2, we examined the wave behavior of photons. In one example we 
considered the one-dimensional propagation of photons through a plasma layer 
barrier, (see Sect. 2.4.1). Photon propagation through a region which is lossy, 
because the photon wavevector is imaginary, is nonetheless allowed. The 
attenuation of the wavefunction amplitude depends on the energy of the photon 
and the thickness of the plasma region. The mathematics that describe 
penetration of a photon through such a region are identical to those of an 





electron tunneling through a potential barrier discussed above. 

In Chap. 2, we also discussed the propagation of photons through a structure 
where the index of refraction is modulated between two or more values. This is a 
different situation, in which the photon wave vector is always a real quantity. 
Such structures, whether periodic or not, give rise to interference, and the 
transmission coefficient shows regions of transmission and reflection. The 
exercises of Chap. 2 show that structures having an extended period variation of 
index of refraction lead to frequency bands of high transmission, separated by 
regions of high reflection. These results are supported by mathematical analysis 
that is similar, but not identical to the analysis that has produced the results 
shown in Fig. 3.17. The feature that is responsible for this similarity is the shared 
wave nature of photons and matter, proposed by Louis De Broglie and 
demonstrated by Clint Davisson and Lester Germer. 



Fig. 3.17 This series of images shows the effect of augmenting the number of barriers in a resonant 












tunneling structure. 

In each figure we show the transmission coefficient on the vertical axis as a function of energy (eV). 
Each calculation uses the following parameters. Barrier potential = 0.25 eV, Barrier width = 5 nm, Effective 
mass in the barrier = 0.092 mg. Effective mass outside the barrier = 0.066 mg. Space between the barriers = 

10 nm. a 2 barriers, b 5 barriers, c 20 barriers, d 50 barriers. 

The principal result is that adding more barriers does not impede the transmission by tunneling. Quite 
the reverse is true. Augmenting the number of barriers enlarges the energy region where transmission may 
occur with high efficiency. 

The calculations show that the resonant transmission window does not change its energy as the number 
of periods is augmented. 

Instead, the resonant transmission region widens to form an energy band. The energy width of each 
band is already well-defined after 5 barriers. Adding additional barriers increases the density of resonant 
states, without changing the width of the energy band 



Transmission Coefficient 


Fig. 3.18 The tunneling coefficient in a superlattice structure gives rise to energy minibands and minigaps. 
The minibands are usually referred to as subbands. This additional energy level structure, created by the 
geometry of the sample, can be exploited to make both photon detectors and photon emitters, using inter¬ 
subband electronic transitions 


It is clear that the resonant tunneling calculation can be used to simulate 
electron energy levels in a one-dimensional periodic structure, i.e., a crystal. At 
the same time, the transfer matrix method can also be used to simulate electron 
energy levels in a non-periodic structure, with varying barrier heights and non¬ 
periodic position. Calculations based on these structures also show that energy 
bands develop, although the resonances are not so perfect as in the case of the 
periodic structure. 

Next, it is instructive to turn the transmission coefficient shown in Fig. 3.17 
on its side. The transmission coefficient characterises the superlattice structure 
over its extent in real space. In Fig. 3.18, we can see immediately that there are 















bands of energy where the transmission coefficient is close to unity. We call 
these minibands . In between, there are regions where the transmission 
coefficient is close to zero, and we call these minigaps . 


3.8 Tunneling Time 

Tunneling is a fundamental manifestation of the Heisenberg uncertainty 
principle and quantum mechanics. In the previous sections we have studied the 
stable states of particle-waves in environments of varying potential energies. In 
each case we assign the particle a single well-defined wave-vector and proceed 
to evaluate the time-independent Schrodinger equation. Since a measurement 
must find the particle either on one side or the other, there is a finite probability 
of finding that the particle has tunneled through the barrier. We can examine the 
steady-state tunneling probabilities based on the solution of the time- 
independent Schrodinger equation probabilities in a straightforward way. 
However, this approach leaves a key question unanswered: how long does it take 
the particle to tunnel from one side to the other? 

3.8.1 Wavepackets 

So far we have treated the wavefunction as a free-particle wave , with a single k- 
vector. This is a convenient mathematically for solving the Schrodinger 
equation. However, the Heisenberg uncertainty principle requires that a particle 
characterised by a single k-vector (i.e. ^ {l ) is delocalized spatially (A* M ). 

The “transit-time” of such a wavefunction does not make physical sense. 

As a first step toward understanding the tunneling transit-time, we modify 
the simple free-particle wavefunction into a spatial wavepacket consisting of a 
sum of such wavefunctions weighted by an appropriate envelope function. 



(3.46) 


The envelope function is typically chosen to be a probability distribution 
such as a Poisson function or Gaussian function. This permits the normalisation 
of (-*), assuring that the probability of finding the particle somewhere in a 

defined spatial range is unity. 

A wavepacket is distributed in space, shown by the diagram in Fig. 3.19a. It 
must also be distributed in k-space, according to the Heisenberg uncertainty 
principle A*A/j > ti } as diagrammed in Fig. 3.19b. The group velocity of each of 
the waves making up the wavefunction can be calculated from the relationship: 
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which implies that each component of the wavepacket has a group velocity 

that depends on the magnitude of its k-vector. If the wavepacket has minimum 

uncertainty at time r = r <>, that is: then at some later time, in the 

absence of any external interaction, the spatial extent of the wavepacket will 
have spread out, while the distribution in k-space remains constant, (conserving 
total momentum), and thus > h (Fig. 3.18c). An important result to retain is 

that higher k-vector components of the wavepacket have a larger group velocity 
than lower k-vector components. 
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Fig. 3.19 a An example wavepacket with a Gaussian spatial distribution. A-t A fik > ft . b The momentum 

( p = ftk ) distribution of the wavepacket in a. c Larger wavevectors have higher group velocity. This 
causes the wavepacket to spread out spatially, even though the wavevector distribution remains constant 


3.8.2 Detection of a Wavepacket Representing a Single 













Particle 

A single quantum particle such as an electron can be detected. In order to 
measure the transit time of an electron that may tunnel through a barrier, the 
electron wavefunction must be localized spatially. This means that the electron 
wavefunction must be a wavepacket composed of a distribution of k-vectors. To 
measure the time of flight of the electron, we refer to the schematic diagram of a 
possible experiment shown in Fig. 3.20. We have considerably simplified the 
details so that the one-dimensional electron wavepacket is launched toward the 
right toward an electron detector placed at a specific distance from the emission 
shutter. At time r = G, a gate is opened allowing the electron wavepacket to 
propagate to the right. What would the observer of the detector record? 



Fig. 3.20 Schematic diagram of an experiment to measure electron transit time in the presence and the 
absence of a tunneling barrier 


The detector is capable of detecting an electron and causing a charge to 
circulate in the detector electronics resulting in a signal. When the detector 
captures an electron, the spatial extent of the wavepacket collapses to the 
location of the detector. The probability of finding the electron at the detector is 
unity, and zero everywhere else. The signal amplitude at the detector is 1 when 
an electron is detected; otherwise it is zero. 

If we perform many such transmission experiments and superimpose the 
results, we see that that the detected arrival times are spread out, with the 















greatest frequency centered around the classical expected value: <r>-* 

In the second group of experiments, we put a single potential barrier in the 
path of the electron wavepacket. The wavepacket is launched with a kinetic 

energy (= _ r L ! that is less than the barrier height. From the investigation of 

tunneling through a single barrier, we know that the transmission coefficient is 
less than unity, and in most cases far less than unity. The transmission coefficient 
increases exponentially as a function of the wavevector. In the wavepacket, these 
are the components with higher group velocity, and they are distributed at the 
leading edge of the wavepacket. 

During the impact of the wavepacket on the barrier, the barrier acts as a 
passive filter, enabling transmission of the larger wavevector components and 
reflecting the smaller wavevector components. Thus the collision reshapes the 
wavepacket amplitude from its original form into a new profile. Focussing for a 
moment on the tunneling phenomenon, the highest wavevector components of 
the wavepacket are clustered in the leading edge of the wavepacket. The 
probability that some of these components of the wavepacket will be spatially 
localized on the right hand side of the barrier will depend on both the pre¬ 
collision probability density of these elements and their corresponding 
transmission coefficients. Following the leading edge are wavepacket 
components with lower values of wavevector. While the probability density of 
these components is higher, their transmission coefficients are orders of 
magnitude lower. The result is that a small peak is formed in the wavepacket on 
the right-hand side of the tunneling barrier. This peak may be fully formed 
before the principal maximum of the wavepacket is incident one the barrier. 

Such a collision is shown schematically in Fig. 3.20. At this point it is important 
to remember that the entire wavepacket, on both sides of the barrier, represents a 
single and indivisible electron (Fig. 3.21). 



Fig. 3.21 When a particle represented by a wavepacket is incident on a barrier, the wavepacket is 










dramatically reshaped into 2 different spatial regions on either side of the barrier. On the left-hand side , the 
incident portion of the wavepacket interferes with the reflected portion. On the right-hand side a smaller 
peak represents the part of the wavepacket that has tunnelled through the barrier. When a measurement 
takes place, the particle is localized either on one side of the barrier or on the other 

The observer running the experiment at the detector has a chart recorder that 
marks the detection of an electron as a function of time. In the discussion that 
follows, we will assume that the observer records the results of many 
experiments on the same sheet of paper, thus accumulating results. Under the 
conditions of the experiment, the expected arrival time of the peak of the free- 
electron wavepacket occurs at (*) = 200 r ? where T is a unit of time. In Fig. 3.22 

we show the results of an experiment in which a free-electron wavepacket is 
launched along an unimpeded pathway toward the detector. In experiment 1, 
shown in Fig. 3.22a, the electron is detected at 1 *= 19 i- 57 ", slightly ahead of the 

expected arrival time. The following experiments show results following 6, 13, 
21, and 50 repetitions. It is apparent that the measured arrival times are clustered 
around the expected arrival time (*) = 200 r The electrons arrive one at a time, 

and for each event, the signal amplitude is unity at the time of detection, and 
zero elsewhere. Each such detection represents the transmission of information 
in the full sense of the theory of special relativity. If the electrons are launched at 
equal time intervals, then the density of points in Fig. 3.22e is proportional to the 
detection rate of electrons at the detector. 
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Fig. 3.22 a-e A series of measurements to detect the arrival time of an electron wavepacket emitted at a 
constant rate from an electron source with a well-defined group velocity. Detection is a statistical event 
based on the quantum mechanical behavior of electrons 


By superimposing the free-electron wavepacket on the results in Fig. 3.22e, 
it can be seen in Fig. 3.23 that there is a one-to-one correspondence between the 
magnitude of the wavepacket and the detection rate of electrons at the detector. 
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Fig. 3.23 The inset shows schematically a calibration of the magnitude of the wavepacket in terms of the 
arrival rate of electrons at the detector 


When a tunneling barrier is placed in the propagation path, we can see 
whether the barrier retards or advances the arrival time. Having measured a 
given detection rate at a particular transit time for electrons that have tunneled 
through a barrier, one can then ask what transit time corresponds to the same 
detection rate for an unimpeded wavepacket. Is it shorter or longer? This 
procedure enables a comparison of the relative arrival times of electrons at a 
detector. A hypothetical experimental procedure might include the following 
steps. 


(i) Fix the position of the detector. 

(ii) Determine the detection rate in the absence of a signal. Suppose that this 
rate is 1 count per second. 

(iii) Choose a particle detection rate higher than the background. Suppose that 
a rate of 10 counts per second is chosen. 

(iv) Launch a stream of electrons, one by one toward the detector. Choose a 
launch rate much larger than the detection rate. Suppose that a rate of 10 6 











launches per second is chosen. 


(v) Measure the time required for the detection rate to reach 10 counts per 
second. 

(vi) Place a tunneling barrier in the path of the electron stream and repeat step 
v. 

(vii) Compare the times in steps v and vi. 

The paper by Eric Fossum (Fossum 2013) presents an example of an 
analogous method used to detect and analyze single photons in state-of-the-art 
quantum image sensors. 

3.8.3 Relative Time of Flight 

Having established the correspondence between the magnitude of the electron 
wavepacket and the arrival rate of electrons at the detector, we can compare the 
time required to detect an electron in the two cases of an electron transit with no 
barrier in the path and electron transit through a single, abrupt tunneling barrier. 
Consider first the case of tunneling through a single barrier. 

First we need to choose an arrival rate r 0 that is well above the background 

arrival rate. Next the rate must be low enough to detect the small peak that is 
formed in the forward part of the wavepacket by tunneling through the barrier. 
The measurement is focused on the leading edge of the wavepacket. No matter 
how much the tunneling event reshapes the wavepacket, the leading edge will 
always appear the same, with the arrival rate increasing in a monotone fashion. 

In the measurement we measure the time duration required for the arrival rate to 
reach the level r 0 . In the example which follows, we will assume that r 0 = 10 

counts/s. The propagation properties of the unimpeded wavepacket are already 
known before the measurement. The peak of the wavefunction displaces with the 
group velocity imposed at launch. We can plot the wavepacket as a function of 
distance at the time determined for the detection of tunneling electrons. The two 
wavepackets are compared in Fig. 3.24. It can be seen that the magnitude of the 
wavepacket that represents the unimpeded electron is greater than that of the 
tunneling electron everywhere along the leading edge of the wavepacket. This 
follows from the result that the tunneling barrier attenuates the transmission 
coefficient when the incident kinetic energy is less than the barrier height. As a 
result, the reflection coefficient must be greater than zero in order to preserve the 


normalisation of the wavepacket to unity (that is, the total area under the square 
of the wavepacket must be 1). In fact the wavepacket is mostly reflected. 
Precisely what is attenuated is the arrival rate. The arrival rate of the unimpeded 
electron reaches the level of 10 counts/s in less time. The result of the above 
measurement is that passage through a barrier by tunneling retards the arrival of 
the electron at the detector. 



Distance 



Fig. 3.24 Comparison of transmission of an electron wavepacket in the absence (shown in the upper curve) 

and in the presence of a tunneling barrier (shown in the lower curve), a At time f = *1 , the count rate of 

the upper, unimpeded wavepacket is sufficient to trigger the detector, while the tunneling wavepacket has 

not yet reached a sufficient rate, b At time t = #2 , the count rates of both wavepackets are sufficient to 

trigger the detector. Of course, the reduction in the transmission coefficient due to tunneling is exactly 
compensated by the magnitude of the reflected wavepacket moving to the left 


In Fig. 3.24a we show a “snapshot” of two experiments taken at time * = *■. 





























Each experiment consists of measuring the arrival rate of electrons at the 
detector which is fixed in position. The electron source shown schematically in 
Fig. 3.20 emits single electron wave packets at the rate of 10 6 electrons/s, each 
with the same fixed velocity v = v 0 . In the upper trace, the unimpeded 

wavepacket arrives at the detector. As time increases, the arrival rate increases 
until the expectation arrival time is reached, r = V, and then the arrival rate 

decreases. After recording the passage of many identical wavepackets, we have a 
result similar to that shown in Fig. 3.22e, and we can determine the time when 
the arrival rate first reaches 10 counts/s. This time occurs at 1 = r i, as shown in 

Fig. 3.24a. From Fig. 3.23 we know that this time corresponds to the arrival of 
the leading edge of the wavepacket. 

Next, we repeat the same experiment, with a tunneling barrier in the path of 
the electron. Transmission through the tunneling barrier creates a small local 
peak in the wavepacket traveling toward the right, and another local peak 
representing the reflected part of the wavepacket that travels toward the left. 
Because the wavepacket magnitude is attenuated, the arrival rate of electrons at 
the detector will also be reduced. As a result, we can see in Fig. 3.24a, that at 
time r - , the arrival rate for the tunneling electrons has not yet reached 10 

counts/s. 

In Fig. 3.24b we show a “snapshot” of the same two experiments, taken at a 
later time: t = t 2 , when the arrival rate of the tunneling electrons reaches 10 

counts/s. During this experiment, we measure the electrons as they reach the 
detector. As time increases, the arrival rate increases until a peak is reached at 
time r = T p 2 , after which the arrival rate decreases. 

When a particle such as an electron, is incident on a potential barrier, only 
two outcomes are possible. The electron is either transmitted or reflected. The 
quantum mechanical wavepacket of unity magnitude is distributed spatially to 
reflect the different probabilities of transmission and reflection. The wavepacket 
of the unimpeded particle also has unity magnitude, but because there is no 
reflected wave, the magnitude is entirely concentrated in the transmitted 
wavepacket. At a fixed position, and at any time, the magnitude of the 
unimpeded wavepacket will always be greater than the magnitude of the 
tunneling wavepacket. This means that the measurement of the arrival rate 
particles represented by the unimpeded wavepacket will also be larger. 

The results of the experiment are. 


1. By comparing the times when the arrival rates of the two cases are the same, 


we conclude that arrival of the unimpeded wavepacket is detected first at 
time r = *i, while the arrival of the tunneling wavepacket arrives at a later 

time, t = t 2 . From the above argument, h>t\ in all cases. 


2. We note that the local maximum in the arrival rate of the tunneling 
wavepacket occurs prior to the peak in the arrival rate of the unimpeded 
wavepacket. That is: V < V. 

However, it does not make physical sense to compare these two times. The 
time T i’i is the expectation arrival time of the unimpeded wavepacket. On the 

other hand, the expectation arrival time of the tunneling wavepacket at the 
detector is infinitely long, because most of the wavepacket magnitude is moving 
toward the left as a result of reflection. 

3. While it is straightforward to determine the relative times of arrival, such an 
experiment does not provide a satisfactory method for defining or 
determining the time of tunneling. To illustrate this point in the example 
above, suppose that we required an arrival rate of 100 counts/s as the 
criterion for detecting the arrival of the leading edge of a wavepacket. We 
could proceed as before for the measurement of the arrival time for the 
unimpeded wavepacket. However, for the case of the tunneling wavepacket, 
an arrival rate of 100 counts/s would probably never be reached, and one 
result of the experiment would be to conclude that no detectable tunneling 
occurred. 

In the next section, we will consider tunneling transit time. We first look at 
tunneling through a single barrier to establish some basic ideas. Next in the case 
of double-barrier tunneling structures, three important effects occur: 
transmission, reflection and resonant storage of the wavepacket in the structure. 

3.8.4 Analysis of the Wavepacket: Phase-Delay 
Tunneling Time 

Analytic computation of tunneling time remains an unsolved problem in 
quantum mechanics. A excellent review of this important problem has been 
given by Hauge and Stovneng (1989). Those wishing to pursue this topic are 
referred to this review as a good foundation. 

In Fig. 3.25 we show a simple rectangular one-dimensional potential barrier 
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Fig. 3.25 One dimensional rectangular potential barrier of magnitude ^ 0 and width 


The particle wavefunction is represented by a wavepacket the peak of which 
can be characterized by a group velocity . The wavepacket is incident on the 

barrier and scatters into a superposition of transmitted and reflected components. 
We emphasize that the physical reality of the tunneling particle is represented by 
one wavefunction that incorporates these two correlated components. After some 
time, the wavepacket intensity in the barrier tends to zero, and the wavepacket of 
the particle is separated by the barrier into transmitted and reflected components. 
This is referred to as the asymptotic regime. The transmitted component can be 
expressed in the form; 

*li T {.x, t ) = (3.49) 

In this situation we can apply the stationary phase approximation: 
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Setting -'T 7 ) ~ vt ', the barrier width, we can solve for a transit time T 
associated with the propagation of the wavepacket through the barrier: 
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The first term on the RHS is the thickness of the barrier divided by the group 
velocity of the wavepacket. The second term is the phase time and depends on 

how the wavepacket is “reshaped” by the barrier. The term di has units of length 


and can be thought of as the extra “distance” the particle has to travel in crossing 
the barrier. It can be positive, corresponding to a time delay, or negative 
corresponding to an “acceleration” of the wavepacket by passage through the 
barrier. 

The phase-time model can be applied to a monochromatic electron wave 
with a single /(-vector at any point in space. In a real experiment involving 
electron passage through a resonant tunneling barrier, it is quite evident that the 
electron wavefunction has a finite width in momentum-space and in real-space, 
as required by the uncertainty principle. This approach confirms, however, that 
different k-components of the wavepacket will tunnel at different times resulting 
in a reshaping of the wavepacket. The large-k, high-energy components, at the 
leading edge of the wavepacket penetrate the barrier first. 

In the discussion that follows, we will use a double barrier structure, with 
each barrier having a width of 0.5 nm and a potential of 2 eV. The separation 
between barriers is 1 nm. The effective mass of the electrons is unity. The phase 
method is used to calculate a transit time for each wavefunction in the 
wavepacket. We consider two important cases: tunneling of a wavepacket having 
a k-vector distribution centered on resonance, where the transmission probability 
is a maximum and tunneling of a wavepacket having a k-vector distribution 
centered in between two resonant levels with a much lower transmission 
probability. 

The tunneling transmission coefficient is displayed in Fig. 3.26a and the 
“phase tunneling time” for a free electron corresponding to the phase model in 
Fig. 3.26b. There is a one-to-one relationship between the peaks in the 
transmission coefficient and the peaks in the phase tunneling time. The most 
tightly bound resonances have the longest tunneling times. In Fig. 3.26b we also 
show the time required for an unimpeded electron particle to traverse the same 
region of space. It can be seen that the time for an electron to transit the structure 


at resonance, when the transmission probability is high, is about 2 orders of 
magnitude longer than for the time required to transit the same region by the 
classical physics of electron transport for an electron particle having the same 
kinetic energy. This result shows that there is storage of on-resonance wave- 
vectors in the potential well. 


Transmission through a double barrier 



Fig. 3.26 a Transmission probability for an electron incident on a double potential barrier of height 2 eV 
and total width equal to 2 nm. b The phase tunneling time can be calculated as a function of kinetic energy 
from (3.51) presuming a perfectly monochromatic electron. This result shows that a non-resonant electron 
will tunnel through the barrier in a time that is about three orders of magnitude shorter than an electron that 
tunnels on resonance. In between we show the time for an electron to traverse the same region of space 
when no barrier is present 


On the other hand, according to the phase time model, off-resonance 
tunneling results in a phase tunneling time that is shorter than that of classical 
transport, although the transmission probability is much lower. The electrons 
whose energy is resonant with the discrete eigenstates in the well will be filtered 
out of a wavepacket, and transmitted with a significant delay compared to 
electrons with off-resonance energies. As a result, significant reshaping of the 













wavepacket occurs. 

This simple model shows some of the important features of resonant 
tunneling, but since the electron is modeled as a delocalized wave, the physical 
meaning of the phase tunneling time is not clear. Tunneling arrival time is 
fundamentally a statistical process, and analysis by continuum mathematics is 
certain to be inadequate. 

An alternative to the analytic approach above is to discretize the Schrodinger 
equation for the wavepacket and follow the evolution of the wavepacket in space 
and in time as it interacts with the resonant tunneling barrier. 
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(3.52) 

The solution method is often abbreviated as FDTD for finite-difference time- 
domain . The details of the method are beyond the scope of this text. The result 
of this approach is a simulation of wavepacket propagation. The method itself is 
enabled by access to modern computer technology which was not available to 
the physicists who developed the phase time delay analysis. The accuracy of the 
solution is limited only by the available quantity of computer memory. 

The results of one such simulation are presented below as an illustration of 
the three interesting cases: (i) the propagation of an unimpeded wavepacket, (ii) 
a wavepacket with a k-vector distribution centered on a resonant state of the 
barrier, and (iii) A wavepacket with a k-vector distribution centered in between 
resonant states of the barrier (referred to as off-resonance). 

The diagram in Fig. 3.27 shows the magnitude of the wavepacket as a 
function of distance. The two-barrier structure is located near the middle of the 
diagram between 570 and 600 A. Three results are superimposed: (i) an 
unimpeded wavepacket (blue curve), a wavepacket with its k-vector distribution 
centered on a resonant tunneling level (green curve) and a wavepacket with its k- 
vector distribution centered off-resonance between resonant tunneling levels 
levels (red curve). The wavepackets are launched toward the double barrier with 
the same group velocity of the wavepacket peak magnitude. Each curve 
represents the spatial distribution of a wavepacket of a single electron, and so the 
total area under each wavepacket is unity. It is evident that the leading edge of 
the unimpeded wavepacket (blue) will be detected first before the arrival of the 
other two wavepackets. 









Fig. 3.27 Wavepacket magnitude as a function of distance. Each wavepacket has been launched toward the 
right with the same mean velocity. The blue-colored wavepacket is propagating toward the right without 
any obstacle in its path. The red and green wavepackets encounter a double tunneling barrier, located 
between 580 and 600 A. The green wavepacket has its k-vector distribution centered on a resonant level of 
the double-barrier structure. The red wavepacket has its k-vector distribution centered between two resonant 
levels. Significant components of the resonant {green) and off-resonant {red) wavepackets are reflected on 
incidence with the barrier and are moving in the left direction. The figure shows a “snapshot” of the 
magnitudes of each wavepacket, taken at the same moment in time 


Both the on-resonance and off-resonance wavepackets have transmitted and 
reflected components. As expected, there is a significant portion of the on- 
resonance wavepacket that is confined to the potential well in the interior of the 
two-barrier structure. The magnitude of this confined or stored portion depends, 
through the boundary conditions, on the magnitude of the wavefunction exterior 
to the double barrier. As the transmitted and reflected free-electron waves move 
away from the double barrier, the magnitude of the stored wavepacket will 
diminish near the barrier. To maintain the boundary conditions, the magnitude of 
the wavepacket inside the barrier must also decrease with time. Thus the storage 
of the wavepacket components is temporary, supporting the concept of a dwell- 
time introduced in the phase-delay analysis. 

The off-resonance wavepacket is almost entirely reflected, but there is a 
transmitted part which can be seen near 740 A. Storage of the off-resonance 
wavepacket inside the double barrier is negligible because the wavepacket 
contains relatively few on resonance k-vector components (Fig. 3.28). 










Fig. 3.28 Enlargement of the transmission region of Fig. 3.27. In this simulation, the unimpeded 
wavepacket (blue) will be detected first at any chosen arrival rate criterion. If the detection threshold can be 

set sufficiently low, (i.e. y < 10 ^ in this example) both the resonant and non-resonant wavepackets can be 
detected. Under this condition, it is not possible in this simulation to distinguish between the arrival times of 
the resonant and off-resonant wavepackets. If the detection criterion is set higher, for example, at y = 0.5 x 

10 , then the unimpeded (blue) and resonant (green) wavepackets can be detected. However, the non¬ 

resonant wavepacket (red) would not be detected. The result of such a measurement would be that the 
unimpeded wavepacket arrives well ahead of the resonant wavepacket, but that the off-resonant wavepacket 
is totally reflected 


3.8.5 Tunneling Time: Some Conclusions 

In our treatment the wavefunction is either transmitted through or reflected by 
the potential barrier. At all times, the sum of the reflection coefficient and 
transmission coefficient is equal to unity. If the probability density for reflection 
is non-zero, then the probability density from transmission is less than unity. 
Compared to the case of a quantum that encounters no barrier, the transmission 
coefficient of a tunneling particle will be less than that for the unscattered 
wavefunction. This implies that if we set up a detection station at any point in 
space to detect the arrival rate of quanta from an unscattered beam, and a beam 
that is scattered by a tunneling barrier, we will count a higher arrival rate of 
quanta from the unscattered beam. Otherwise stated, we will have to wait for a 
longer time to detect the same number of quanta from the beam of tunneling 
particles compared to the case for the beam of unscattered particles. In this 
simple model, the transit time of the particle is retarded by tunneling relative to 
the situation where the potential barrier is absent. 



















In our model we have represented the quantum by a wave packet as a 
summation of individual free-electron wavefunctions, each represented by its 
characteristic wavevector. The Schrodinger equation that we have used is a 
linear differential equation, and as such, it treats each wave vector component of 
the wavepacket independently of the others. The important result of this 
mathematical situation is that there is no interaction between different 
wavevector components of the wavepacket. 

The FDTD simulation permits the visualization of the time-evolution of this 
wavepacket, according to the mathematics of the Schrodinger equation. This 
simulation shows that the wavepacket propagation is dispersive, with the larger 
wavevector components advancing faster than the smaller wavevector 
components. This is the case for both the wavefunction of the quantum that 
proceeds by tunneling and the wavefunction of the unscattered quantum. Thus, if 
we set up a detection station at any point in space to detect the arrival rate of 
quanta, the wavepacket components with the largest wavevectors will arrive 
first. If the reflection coefficient for the tunneling wavepacket is greater than 
zero, then the transmission coefficient will be less than that of an unscattered 
wavepacket. At any detector position, we will count a higher arrival rate of 
quanta from the unscattered beam. 

The result of using Schrodinger’s equation for calculation of the transmission 
coefficient of quantum tunneling of a wavepacket through a potential barrier is 
similar to the action of a linear filter. Some wavevector components of the 
wavefunction are transmitted with a higher amplitude than others. All the 
wavevector components of the unscattered wave are transmitted with a higher 
amplitude than those of the tunneling wavepacket. Thus, regardless of the initial 
shape of the wavepacket or the spatial complexity of the barrier, a quantum that 
propagates by tunneling through a potential barrier will be retarded relative to a 
quantum that does not encounter such a barrier. 


3.9 Summary 

In this chapter we treat quanta having a non-zero mass, such as the electron, as a 
plane wave. The Schrodinger equation is a scalar differential equation that is 
used to calculate the stationary states of quanta, and we have explored a number 
of solutions of this equation in one-dimension. 

Physical confinement of a quantum particle translates into potential energy. 
The dimensions of the confinement impose an upper limit on its wavelength and 
thus a minimum wavevector. The energy associated with a quantum with a non¬ 
zero mass is proportional to the square of its wavevector, and so confinement 




endows the electron with a minimum energy. 

Tunneling of quantum particles through a potential barrier is an astonishing, 
and real feature of quantum behavior. Solutions of the Schrodinger equations 
show that the probability of finding a single quantum is non-zero on both sides 
of the barrier. Both reflection and transmission are possible. However a quantum 
is non divisible, and in a measurement, only one or the other result can be 
measured. Thus the wavefunction collapses to the eigenstate of the 
measurement, and it must collapse instantaneously in order to conserve mass, 
(the proof of which is the subject of Exercise 3.1). 

Tunneling behavior can be described by calculation of the time-independent 
transmission coefficient. The transmission coefficient for two adjacent barriers is 
characterized by resonant energy levels where the transmission coefficient can 
reach unity. The transmission coefficient for three or more such barriers in 
tandem shows energy broadening of these resonant levels into energy bands. 

Electron tunneling is used to make commercial photonic devices such as 
lasers and detectors which are capable of high-speed operation. The question of 
tunneling time has both fundamental and practical importance. Modeling the 
electron by a simple plane wave can be used successfully to calculate the stable 
energy states. However, since such a wavefunction is delocalized spatially, it 
cannot tell us about the time required to transit a tunneling structure. To 
approach this question, we have modeled the quantum as a wave packet, having 
a distribution of k-vectors, so that its spatial position can be determined. 
Numerical simulations using such wavepackets in the time-dependent 
Schrodinger equation show that quanta are retarded by passage through a 
tunneling barrier. 


Exercises 

3.1 The application of Schrodinger’s equation to one-dimensional electron 
tunneling through a potential barrier shows that the electron wavefunction 
is non-zero on both sides of the potential barrier. However, experiment 
shows that the electron is either entirely on one side of the barrier or the 
other. The Copenhagen interpretation of quantum mechanics postulates that 
measurement causes the collapse of the wavefunction. Show that 
wavefunction collapse must occur instantaneously in space (and not at the 
speed of light), if matter is to be conserved. 




3.2 See Fig. 3.29 
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Fig. 3.29 Energy-distance diagram for an infinitely deep, 1 dimensional potential well of width = L 


Solve Schrodinger’s equation and determine: 


(a) quantization conditions 

(b) energies of the stable states 

(c) normalized wavefunctions 

Compare your solution to that developed in Sect. 3.1.3. Which features 
are simpler? Which features are obscured? Which approach do you prefer? 


3.3 Using a diagram of a wavefunction confined in an infinitely-deep potential 
well, demonstrate that the first-order Stark shift is zero, i.e. 

A/:] = q&x = 0 us i n g a symmetry argument. 










3.4 A double heterostructure of AlxGal-xAs/GaAs forms two quantum wells, 
one in the conduction band and the other in the valence band, as shown in 
Fig. 3.30. Show that the Stark shift lowers E gap , the minimum energy 

difference between the valence band and the conduction band states. This 
voltage-dependent shift in E gap can be applied to make a high-speed 

absorption modulator for communications lasers. 
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Fig. 3.30 A schematic diagram of a quantum well structure of AlxGal-xAs/ GaAs, prior to 
applying an electric field 


3.5 See Fig. 3.31 
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Fig. 3.31 A one-dimensional, infinite periodic potential, known as the Kronig-Penney model 

The Kronig-Penney model treats the transport of electrons through a 
one dimensional infinite periodic potential. The result of the analysis gives 
relationship relating electron energy e to electron wavevector 4: 

1 (am* 8m* \ 

cos (qd) = cos (a(d - .?)) cos (8s) — —- + -—1 sin (a (d - ,t)) sin (8s) (3 53) 

- \PK mtt*J 

(see Morrison et al. 1976, p. 465) 
where: 

d is the period 

s is the width of the well region 

- /:) j which can be a complex variable 

a = i 


(a) Use (3.53) to plot wavevector versus energy. Show that the allowed 
energy states form bands, separated by energy gaps where electron 
states do not exist. 

(b) In Sect. 3.7, we showed that tunneling through a sequence of barriers 
and wells also gives rise to energy bands. By numerical simulation, 
compare the energy dependence of the transmission windows for 
sequential tunneling through a 4-barrier, 3-well structure, with the 
energy bands of the Kronig-Penney model having the same 
dimensions. Use GaAs and Al 0 3 Ga 0 7 As for the materials: 





















V 0 = 0.35 cV 


m G^s = 0066 m* 

= 0-092im* 

3.6 The expression for the transmission coefficient for tunneling through a 1- 
dimensional potential barrier is T - 16>^"' 2vjL (^j) (see 3.39) show that the 

phase time delay is given by: = ^fr. 

This expression is known as the Larmor tunneling time . 


Appendix: Source Code for Calculation of Sequential 
Tunneling Using MATLAB 


%%%%%%%%% Sequential tunnelinq %%%%%%%%%%% 

% In addition to this file there is a second file "matl.m". 

% In order to run, both files are to be in the directory in 
% which you invoke Matlab. 

% This program plots the transmission coefficients versus energy for a 
sequence of 

% potential barriers of the same height vo, the same thickness, d2-dl=5 
nm and the same periodicity of 15nm 

% These parameters can be changed to suit the example under 
consideration 


% Initial conditions and definitions 
% Initial energy in eV 
Xal = 0.01; 

%step in eV 
st = 0.00003; 

% final energy 

Xa2 = 0.3; 

pe = Xal:st:Xa2; 

y= [ ] ; 

mtl =[]; mt2=mtl;mt3=mtl; mt4=mtl; 
h = l/1.05e-34; 
mo =9.11e-31; 

% Barrier in eV 

vo - 0.25; 

vo = vo*1.6e-l9 

% Particle effective mass 

ml = 0.066; 

m2 = 0.092; 

m3 = ml; 




ml = ml*mo; 
m3 = m3*mo; 
m2 = m2*mo; 


for k=l:length(pe), 

E = pe(k)*1.6e-19; 
kl = h*sqrt(2*ml*E); 
k2 = h*sqrt(2*m2*(E-vo)); 
k3 = kl; 

rel = (k2*ml)/(kl*m2); 
re2 = (k3*m2)/(k2*m3); 
dl = 0 ; 
d2=5e-9; 

%initial calculation to set mt3 
mtl =matl.(vo,rel, kl,k2, dl) ; 
mt2 =matl(vo,re2,k2,k3,d2); 

% cascade the barriers 
mtl=mtl*mt2; 
mt3=mtl; 


% next barriers: nb = number of barriers 
for nb=l:4 9; 
dl=dl+15e-9; 
d2=d2+15e-9; 

mtl =matl(vo , rel r kl , k2 r dl); 
mt2 =matl(vo,re2 , k2 , k3 , d2); 

% cascade the barriers 
mtl=mtl*mt2; 
mt3=mt3^mtl; 
end; 

y(k) = (ab s(1/mt 3(1,1))) A 2; 
end; 

axis ( [0 0.3 loglO(le-8) 0]); 
plot(pe,loglO(y)); 

% This subroutine computes the transmission matrix from one medium 
to another 

% It is needed for the sequential tunneling program 


function[themtl]=matl(Vo,R,kl,k2,x) 
A1 = kl + k2; 

A2 = k2 - kl; 


Rowll = 

(1 + R) 

* 

Rowl2 = 

(1 - R) 

* 

Row21 = 

(1 - R) 

* 

Row22 = 

(1 + R) 

* 


exp(j * A2 * x); 
exp(-j * A1 * x); 
exp(j * A1 * x) ; 
exp(-j * A2 * x); 


themtl 


0.5 * 


Rowl1,Rowl2; Row21,Row2 2] ; 
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Footnotes 

1 Note: Although the BenDaniel-Duke boundary conditions were developed about a half-century ago, this 
continues to be an area of current study. For example, the reader is also referred to G.T. Einvoll and L.J. 
Sham, Phys Rev B 49 10533 (1994) for an excellent tutorial discussion on this topic. 


2 Note: This review is one of the best available treatments of the tunneling time problem in quantum 
mechanics. 


3 Note: The most popular model to calculate the generation is Kane’s Model derived from the WKB 




method. Kane’s model is derived for a direct gap semiconductor in a uniform electric field and is given 
by: 


T (E) = A 




where E is the electric field and the bandgap, while A and B are parameters depending on the 

effective mass of valence and conduction bands and D takes a default value of 2, but for the sake of 
generality it is left unspecified as an adjustable parameter. 
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Abstract 

Electronic bandstructure is the energy-momentum relationship for an electron. 
The behavior of a totally-free electron in a crystal lattice shows the constraints 
that symmetry alone places on the energy-momentum relationship. Electrons in 
semiconductors are not free. Their wavefunctions take the form of Bloch 
functions containing a part that has the periodicity of the crystal structure and a 
free-electron term. The electronic bandstructure of many semiconductors of 
technological importance can be calculated using the pseudopotential method. 
The power of the pseudopotential method resides in its ability to substitute a 
fictitious potential that accounts for the behavior of the electrons in the weakly- 
bound valence band states. A pseudopotential can be calculated from first 
principles if the ground-state configuration of the material can be determined. In 
a spirit similar to that of the pseudopotential method, Pierre Hohenberg and 
Walter Kohn proved that the ground state configuration of an ensemble of atoms, 
such as a crystal or a molecule, is a functional of the correct electron charge 
density: that is, a function of only three spatial dimensions. Density functional 
theory has enabled the determination of the ground-state configuration not only 
of semiconductors whose structure is relatively simple, but more importantly of 
complex molecules and proteins with relevance to biochemistry. 



4.1 Introduction 

The electronic and photonic properties of semiconductors are a direct function of 
their electronic energy levels . The wavefunction of the electron in a crystalline 
material can be represented by the product of a free-electron term and a term that 
has the periodicity of the crystal lattice. The solution to Schrodinger’s equation 
using this wavefunction gives the energy levels, and the energy-momentum 
relationship of electronic states; that is, the electronic bandstructure, to an 
excellent level of accuracy. 

In this chapter we investigate the electronic bandstructure of crystalline 
semiconductors. The behavior of a totally free electron in a crystal lattice shows 
the constraints that symmetry alone places on the energy momentum 
relationship. Electrons in semiconductors are not free. Their wavefunctions take 
the form of Bloch functions containing a part that has the periodicity of the 
crystal structure and a free-electron term. A realistic and accurate energy- 
momentum relationship can be calculated for an electron by taking account of 
the periodic potential of the particular semiconductor under study. The electronic 
bandstructure of many semiconductors of technological importance can be 
calculated using the pseudopotential method . The power of the pseudopotential 
method resides in its ability to substitute a fictitious potential that accounts for 
the behavior of the electrons in the weakly-bound valence band states. 

A pseudopotential can be calculated from first principles if the ground-state 
configuration of the material can be determined. In a spirit similar to that of the 
pseudopotential method, Pierre Hohenberg and Walter Kohn proved that the 
ground state configuration of an ensemble of atoms, such as a crystal or a 
molecule, is a functional of the correct electron charge density: that is, a function 
of three spatial dimensions. Density functional theory has enabled the 
determination of the ground-state configuration not only of semiconductors 
whose structure is relatively simple, but more importantly of complex molecules 
and proteins with relevance to biochemistry. 


4.2 Periodicity 

A crystal is a periodic arrangement of atoms called a lattice. Each point in the 
lattice can be represented by a vector: = 'nai + tt 2 a 2 + nj where the w are 

integers. The are the set of non co-planar vectors that form a unit cell of the 
lattice. The unit cell contains all the symmetry elements of the crystal structure 




The choice of is not unique. The sole requirement is that starting from a lattice 

point any other point in the lattice can be reached by a linear combination of 
these vectors. In practice, one chooses the set of the shortest vectors that meet 
this requirement. In Fig. 4.1 we show an example to illustrate this concept for a 
simple two-dimensional face-centered lattice. 



Fig. 4.1 Some lattice translation vectors for a 2-dimensional face-centred lattice. Note that the choice of 
vectors is not unique 


The Wigner-Seitz cell is a useful concept to describe the area of “unique 
space” in the lattice. This cell is bounded on all sides by the perpendicular 
bisectors of all the lines that connect the nearest neighbors of a single lattice 
point. This procedure is illustrated in Fig. 4.2 for 2-dimensional face-centred 
square lattice. 







Fig. 4.2 Construction of the Wigner-Seitz cell for a 2-dimensional face-centred lattice. The cell is bounded 
by the perpendiculars to the lines joining nearest-neighbor lattice points 


The most commonly-encountered crystal lattices in 3-d solid-state materials 
are face-centered cubic (fee), hexagonal (hex) and body-centered cubic (bcc). 
Silicon, diamond, germanium, and many III-V semiconductors, such as GaAs, 
crystallize in the fee lattice. Some III-V materials, such as GaN, ZnS and ZnSe 
crystallize principally in the hex lattice, but can also be made to crystallize in the 
fee structure. Illustrations of these structures are shown in Fig. 4.3. Each crystal 
structure is accompanied by its Wigner-Seitz cell. 


















Fig. 4.3 Some examples of Wigner-Seitz cells for 3-dimensional crystal structures, a simple cubic, b face- 
centred cubic, c body-centred cubic, d hexagonal. The boundary of each cell is determined by the planes 
that bisect the lines that join the atomic positions that make up the crystal lattice structure 

The Wigner-Seitz cell in three dimensions is constructed in an analogous 
manner. It is the volume bounded by the perpendicular planes that bisect the 
lines that connect nearest neighbors in the crystal lattice. 


4.3 Periodicity and Electron Energy States 

By definition, the physical environment at each lattice point is identical to that of 
any other lattice point. Thus any observable physical property, like energy or 
charge density must also have the same symmetry as the lattice. In order to make 
this statement more quantitative, we introduce a translation operator which 

translates a function by the primitive translation vector K. 


TrJ(v) =/(■■+ R) 

Starting with the Schrodinger equation: 


(4.1) 
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(4.2) 


That is, the energy eigenvalues of the Schrodinger equation are the same 


(invariant) when the wavefunction is translated by the periodicity of the lattice. 
The charge density is given by the squared amplitude of the wavefunction. This 
implies that the wavefunctions of the initial and the translated state are the same 
within a phase factor of unity magnitude. 


Le 7>,0FJ = T(r+ R () = ( 4 . 3 ) 

Equation 4.3 is also known as Bloch’s theorem . 

The electronic properties of semiconductors are determined by the wavelike 
properties of delocalised electrons and holes. We can separate the wavefunction 
into two parts one with the periodic behavior of the crystal lattice, and the other 
with wavelike behavior. 


(k, r) = k, r) 

Substituting this form into (4.3) gives 


(4.4) 


'V (k, r + R r » = e 
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(4.5) 

where we have used the periodic property of ( k ’ r ) ■ 

Wavefunctions of the form of (4.4) are called Bloch functions . 

Electronic states in crystals are defined by energy and momentum ( = ^ k ). We 

can now show that there are a set of basis vectors in k-space; which are also 
periodic: 

(k + G*, r) = R -T{k + G mn r) = A* V* “-’P (k + G ni . r) (4.6) 

Bloch’s theorem holds: 

(k + G m , r) - £ lk Rj T (k + G m , r), pmvidcd that G fli - R; - hm 

We can construct the vectors of k-space using the condition of (4.6): 

G ni = m. b. + /^b: + 

where b 3 , b 2 and b 3 are by analogy the primitive vectors of the lattice in k- 
space. 

The periodicity condition requires that the vector b 1 be perpendicular to 
both a 2 and a 3 , and similarly for b 2 and b 3 . 


(4.7c) 


b 5 = y(a, xa 2 ) 

where V is the volume of the Wigner-Seitz unit cell. 

As an important example, we will consider the reciprocal lattice for Si, Ge 
and other semiconductor crystals. Si, Ge and most III-V semiconductors 
crystallize in the face-centered cubic (fee) lattice structure. A set of real-space 
primitive lattice vectors could be: 

-■fl 11 °) 

= 1 ') (4.8) 

a, = f( 1 0 I) 

where a 0 is the lattice parameter of the fee structure. 

To find a corresponding primitive vector of the reciprocal lattice , we apply 
(4.7a) 
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(4.9) 


Similar operations are used to find b 2 and b 3 . The result shows that the 

reciprocal lattice has primitive translation vectors that correspond to a body- 
centred cubic structure . 


Example: show that the reciprocal lattice of a body-centred cubic structure is 
a face-centered lattice. 

In units of nft, some of the points of the body-centered reciprocal lattice are in 
order of length: 







G„=(0 0 0) 
G 3 = ( 1 1 1 ) 
G 4 = ( 2 0 0 ) 
G, = ( 2 2 0 j 
G|| = ( 3 1 1 ) 
G, 2 = ( 2 2 2 ) 


(4.10) 


where the subscript on G refers to the square of the vector length. In general 
there are multiple vectors for a given length. For example, there are 12 distinct 
G-vectors that have length 8 . 
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12 

G 11 

: 24 

(t|2 

: 8 


(4.11) 


Note that some lattice points such as ( 1 {l {l ) or ( 1 1 ), which exist in the 

real space lattice, are not present in the reciprocal lattice. 


4.4 Brillouin Zones 

The lattice in reciprocal or k-space (k has units of f) is periodic, and so we can 

identify a unit cell, called the Brillouin zone that is analogous to the Wigner- 
Seitz cell in real space. Leon Brillouin was a classmate of Louis de Broglie and 
authored many key texts in solid-state physics, including the landmark study of 
wave propagation in periodic structures (Brillouin 1946). The importance of the 
Brillouin zone concept can be appreciated by considering the relationship 

between momentum (p = fik) and energy (£ = i^ - ) for a free electron in a 

periodic 1-dimensional environment, as diagrammed in Fig. 4.4. Starting from 
the point k = 0, we have plotted energy as a function of momentum in Fig. 4.4a. 
Because the structure is periodic in k-space, all the lattice points having the same 
are equivalent, and we can plot out the same energy-momentum relationship 

starting from any one of them. This is illustrated in Fig. 4.4b for the case of a 
simple one-dimensional lattice. Finally, it is clear that all the unique information 




about energy and momentum is contained in a region indicated by the 
boundaries shown in Fig. 4.4c. This region is the Brillouin zone. One could 
identify many such zones in the structure shown in Fig. 4.4. Because of 
periodicity, each one contains all the essential information (Jones 1975). 



Wavevector 



Wavevector 



Fig. 4.4 A simple 1 -dimensional periodic structure in k-space. a The energy momentum relationship for a 

























particle is parabolic. The vertical lines indicate wavevectors that are displaced from the origin by a 
reciprocal lattice vector W = — . b In the environment of a crystal lattice, this parabolic relationship 

repeats with the period of the reciprocal lattice: ~ . c Note that the Brillouin Zone can be identified readily 

as a region of width ~ delimited by the bold-face vertical boundaries in Fig. 4.4b 


One could say that the energy-momentum relationship in a single Brillouin 
zone is the result of folding the diagram of Fig. 4.4b so that the adjacent portions 
overlap each other with the periodicity of the reciprocal lattice. However, such 
zone-folding is more than a geometric artifice. It is clear from Fig. 4.4c that 
periodicity imposes that for any given value of electron momentum there is 

a multiple of possible energy values: 




E x (k r ) 



2m 

Jr (kr ± Cm ) 2 
2m 


(4.12) 


fr( k,±G 2 f 4 

E -2 (kj) =---, etc. 

2m 


These multiple values are indicated in Fig. 4.4d. The calculation of the 
electron energy levels in real crystals follows this route, as will be demonstrated 
in Sect. 4.5. 


Example: The role of symmetry in 3 dimensions, the free electron solution 
using parabolic equations 

The fee crystal structure is of primary importance for solid-state photonics 
because it is the crystal structure for a large number of photonic semiconductors 
such as Si, Ge, GaAs, InAs CdTe, ZnS to name a few. In (4.7) we have shown 
that the reciprocal lattice of the fee structure is a body-centred cubic lattice. The 
Brillouin zone for the bcc structure is identical in shape to the Wigner-Seitz cell 
for a body-centered cubic, and it has already been shown in Fig. 4.3d. All that is 
needed is to specify the boundaries of the Brillouin zone in terms of the lattice 
parameter ^ of the real-space fee structure. Recalling that the boundaries of the 

Brillouin zone are determined by the point of bisection between adjacent points 
of the reciprocal lattice, we can determine the points along high-symmetry axes 
using simple geometry. 

The centre of the Brillouin zone at zj () is called the T-point. In the 0^0) 





direction, the nearest neighbor in k-space occurs at zj ( 2 ). This high- 

symmetry direction is called the A-axis . Therefore the boundary of the Brillouin 
zone occurs at ^ ( 1 0 (J ), which is called the X-point. The distance in k-space 

from T to X is ^. Along the 0 1 0 direction, called the A-axis , the boundary of 
the Brillouin zone occurs at zj ( { \ \ ), called the L-point. The distance in k- 

space from T to L is ^ V3. The Brillouin zone, which has the appearance of a 

truncated octahedron, is shown in Fig. 4.5, where we have identified the high- 
symmetry axes and boundary points (Jones 1975). 



Fig. 4.5 The Brillouin zone for the fee real-space lattice is an octahedron, truncated along the (IdO) 
directions. The p -point is located at the center of the zone and has the highest symmetry (Madelung 1981) 


With this information we can calculate the energy-momentum relationship 
for a free electron, subject only to the condition of motion in a lattice with the 




periodicity of an fee crystal structure, but having a uniform potential throughout. 
The energy-momentum relationship has been given in (4.12): 


E n (kj) = 


For Go - ^ ^ ^ ), and k along the A-axis, 

£o(k A ) = from k A = — ( 0 0 0 ),to k A = — ( 1 0 0 } 

2m ao v f ao v y 

When k A = — {0 0 0 ) ,£ 0 (k A ) = 0 

For k A = — (1 0 0 ), at the X-point£ 0 (k A ) =-— 

For G 3 = ^ ( ± 1 ± 1 ± 1 }, and k along the A-axis, there are 8 calculations, one 

corresponding to each G 3 -vector. As before: 

^2 / \2 2/r 2 jt 

£3 (k A ) - -—,from k A = — { 0 0 0 ), to k A = — { 10 0) 

2 m do do 

For ^ 0 0 0) } we are at the center of the Brillouin zone, and 


ki= — ( 1 0 0 ): 


for G3 = — (+1 ±1 ±0 
af, y ; 

and, 

Ej {k A ) = , ( 4 -fold degenerate) for G3 = — ( -1 ±1 ±1 ) 

md^ do y ! 

The cubic symmetry of the Brillouin zone splits the band into two parts. At 

the zone center the energy level is 8-fold degenerate. At the X-point, there are 

two bands, each of which is 4-fold degenerate. 

Next we would use the set of G 4 vectors. 

T T 

G 4 = — f ±2 0 0 ), G 4 = — ( 0 ±2 0 ) and G 4 = — ( 0 0 ±2 ). 

do y f ^ 0 v f an v ; 

and calculate the bandstructure along the X-direction as shown above. 
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We can repeat this procedure along the A-direction. 
For = ), and k along the A-axis, 




rf_ (k,r 

2m 


7T SjT 

, from k* = —( 0 0 0 ), to k* = — \ \ ) 

{to {to ' " ; 
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For k* = —( 0 0 0 ), ib (k*) = 0 

„ , 2x ( . . . , , 3fi 2 tt 

For k L = — H \ , £o (kj 

{to ' - ; 


2 ma^ 


For ( ±! ±] ±] ), and k along the A-axis, as before, there are 8 

calculations, one corresponding to each ^-vector. However, the symmetry 

along the A-axis is different from that along the A-axis, and the 8 energy levels 
are split into 4 bands. 
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noil degenerate, for 
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Gj = — ( +1 +1 +1 ) 

{to v ; 

and, 

E 3 (k L ) = —--, non degenerate, for G^ = — ( -1 -1 -1 ) 


2 maj 


{to 


1 QTt~7T- 

E 3 (ki) =-—, 3-fold degenerate for G 3 = — ( -1 +1 +1 ) and permutations 

2ma~ {to v ; 


and 


E 3 (k L ) =-—, 3-fold degenerate, for G, = — ( -1 -1 +1 ) and permutations. 

2ma^ {to v ; 

There are 8 distinct vectors at the r-point, all having the same energy, 

resulting in 8 -fold degeneracy. Moving away from the r-point in any direction 
lowers the symmetry, and lifts the degeneracy to some degree. For example: 
along the A-direction toward the L-point, the lower symmetry environment 
results in 4 energy-momentum bands. The upper and lower energy bands are 
non-degenerate, while the two intermediate energy bands are each 3-fold 



degenerate. On the other hand, along the A-direction toward X, more 
symmetries are preserved, and the 8-fold degenerate state at r is split into 2 
four-fold degenerate bands. 

A diagram of these energy free-electron energy bands for the fee crystal 
structure is shown in Fig. 4.6 (Jones 1975). 
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Fig. 4.6 The energy-momentum relationship for a free electron in a 3-dimensional fee lattice. This diagram 
applies for any material and shows only the constraints imposed by symmetry 


The bandstructure diagram of Fig. 4.6 shows the behavior of the energy- 
momentum (p = fik) relationship for a free electron in a three-dimensional fee 

symmetry environment. It is not material-specific. The crystal lattice is “empty” 
because we have not yet integrated the effect of atomic potential. At the T-point 
at the center of the Brillouin zone, the electronic wavefunctions see the 
maximum symmetry of the fee structure: 48 operations (rotation, reflection, 
inversion) that leave the structure invariant. As the wavevector increases away 
from the zone center in any direction, the symmetry is decreased, the degeneracy 
of the T-point states is removed and the energy states split up. In Fig. 4.6 these 
states are plotted for two high-symmetry directions. Some splitting of the states 
occurs, but the degeneracies are not completely removed. Incorporation of the 
atomic potential will modify the energy of the electron states. The atomic 
potential will further reduce symmetries, and in particular those at the zone 
center, splitting the state into different energy levels. For the semiconductor 
materials of interest, the splitting creates an energy gap between the eight 
states at the T-point. This energy gap is the fundamental feature of the photonic 










properties of these materials. 


4.5 Electronic Bandstructure of Specific Materials: 
Integrating the Crystal Potential 

4.5.1 Introduction 

Our objective is to determine the relationship between energy and momentum 
for stable states of electrons in a crystalline semiconductor such as silicon, 
gallium arsenide and the related photonic materials. These states are solutions of 
the Schrodinger equation for electrons moving in the periodic potential of the 
constituent atoms. However, we are not interested to know the solutions in this 
case, but rather the relationship between the electron wave-vector and energy 
that is required by the solutions. The photonic behavior of the material as a 
whole can be understood by knowing the energy of the stable states and the 
probability that these states are occupied by an electron (or hole). 

In the case of a silicon atom, there are 14 electrons that surround the nucleus. 
The atomic configuration is: Is 2s 2p, accounting for 10 core-state electrons, and 
4 electrons in the 3s and 3p states. The core-state electrons are tightly bound to 
the nucleus. Their major role in our case is to shield the nuclear potential. When 
silicon atoms bond to form a crystal, the valence-band electrons form s-p hybrid 
orbitals. They are responsible for determining the tetrahedral symmetry of the 
crystal structure. Four of the orbitals are completely filled by electrons and 
constitute the valence band (or bonding state) of the solid. One orbital is empty 
and constitutes the conduction band (or anti-bonding state). A perfect silicon 
crystal at absolute zero temperature would appear to be an insulator with its 
valence states fully occupied and its conduction states completely unoccupied 
and separated from the valence states by an energy gap equal to the energy 
required to break a crystal bond. 

4.5.2 Schrodinger’s Equation in a Periodic Potential 

In principle, one could solve the Schrodinger equation for all the electronic 
states in the crystal. This would prove to be an immense task, and a large number 
of the results would concern core electron states, whereas we are interested only 
in the properties of electrons in the valence-band and conduction band states. A 
more interesting and efficient approach is to describe these electrons with Bloch 
functions in the presence of a periodic potential having the symmetry of the 
crystal lattice. 




Schrodinger’s equation says that the stable states of an electron are those for 
which total energy is conserved: 


- V 2 ¥ (r) + U (r) V (r) = E (r) V { r) 

2 m 


(4.13) 


We can use the periodic behavior of the potential to advantage by using 
Fourier analysis to decompose the potential and Schrodinger’s equation into 
Fourier components (Kittel 1986). 

The periodic potential can be rewritten: 


U(r) = '£ j Uce iGT (4.14) 

where the G are the vectors of the reciprocal lattice . 

The wave-function states can be expressed, 


T <r) = m* (r)tf ikT = ^ C (ky* r (4.15) 

The periodic nature of (*) is contained in the Fourier components: c (k). 
Substituting these results gives: 


^ ^-JPC (k) <?* r + ^ U G Yj c <k)e* t+0> ' r = E k C (k)^ A r 




(4.16) 


C 


This equation is true for each Fourier component ( ( k! , which gives: 


- L’kJ C (k )^ rl1 r ^ UqC (k - G) e**-G*Gy* = o 



(4.17) 


Thus, by Fourier analysis we have transformed a single differential equation 
into a series of linear homogenous equations, one for each reciprocal lattice 
vector. There is a different series for each value of k in the Brillouin zone. This 
shows the tremendous advantage of working in reciprocal (Fourier) space. 

The solution to these equations will give each of the coefficients T(kh 


However, we are not interested here in knowing this solution. Organising the 
equations in matrix form, we can solve for the eigenvalues of the matrix. This 
procedure gives all the energies that are associated with each value of wave 
vector k. In so doing we can determine the relationship between ^k, the electron 

momentum and , the energy of the state with this momentum (Hamaguchi 


2010 ). 


(4.18) 
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The calculation proceeds as follows. First all the reciprocal lattice vectors to 
be used in the calculation are determined. In the example that follows, we will 

use all reciprocal lattice vectors from ( | J I J ) through (1 1 1 ). There are 51 in 

all, which is determined by adding the number of reciprocal lattice vectors in 
(4.11) for through Gm . Next we write the 51 equations in matrix form. Then 

we will pick a value for k, the wavevector. The eigenvalues of the matrix can be 
calculated using efficient existing computational routines, determining the values 
of energy of the stable states of the system for that particular wavevector. There 
are 51 energies, but some values will be identical (degeneracies). Next, k is 
incremented and the calculation is repeated. This procedure, while it appears 
tedious, can be completed in a few seconds on a personal computer for k vectors 
lying along the major symmetry axes of the Brillouin zone. The values of I T <; are 

the Fourier components of the potential that is calculated to represent the 
environment that an outer shell electron would experience. It is called a 
pseudopotential because it replaces the actual atomic potential and the screening 
action of the core electrons around the nucleus. As a result, the energies 
determined by taking the determinant of coefficients apply to the states of the 
electrons in the conduction and valence bands. 

The pseudopotential method can be used when the core electron states shield 
conduction and valence band electrons from the nuclear potential (Cohen and 
Heine 1970). This would be a good approximation for Si, Ge or GaAs, but not so 
obvious for diamond or graphene because the n = 2 shell is not complete (see 
Exercise 4.3). 

The pseudopotential method was developed through the efforts of many 
condensed matter theorists. An excellent review of this work is given in Volume 
24 of the Solid-State Physics series (Heine 1970), and in the monograph by M.L. 
Cohen and J. R. Chelikowsky (1989) cited in the references to this chapter. This 
is highly recommended reading in order to better understand the physics behind 
this important work. 

For a bandstructure calculation of silicon using the first 5 groups of 
reciprocal lattice vectors Go through Gi i j there are three non-zero Fourier 








components of the potential. 

The pseudopotential form factors energies are referenced to ^ which is set 
to zero for convenience. is zero by symmetry for all fee crystals. Thus, there 

are only three parameters needed to determine the pseudopotential needed to 
perform the calculation. These parameters can be calculated from first principles, 
but are usually adjusted by using experimental data. The values in Table 4.1 give 
the bandstructure energies of silicon within a few percent over the entire 
Brillouin zone, using only the first five wave vectors of the lattice. Greater 
accuracy can be obtained by extending the calculation to include more reciprocal 
lattice vectors at the price of longer computing time. 


Table 4.1 Pseudopotential form factors in eV for the first 5 reciprocal lattice vectors of silicon 



Length in units of 

2jt 

m 1 

Number of equivalent 
wavevectors 

Wc?, Silicon pseudopotential component 
(eV) 

Go 

0 

i 

0 

CT 

3 

8 

-2.856 


4 

6 

0 

g k 

8 

12 

0.544 

Gm 

11 

24 

1.088 


The actual determinant has too many terms to write explicitly here. To give 
an example, we will write the matrix of coefficients using only the first two 

groups of lattice vectors, % ( {) {l {l ) and % ( 1 1 1 ) - In this example we 
pick k = |(cu 0 0). 


There are two Fourier components of potential, and ^c 3 . We can choose 

= 0. in the case of silicon Gg 3 = -2.856 eV ? anc [ the matrix size is 7 x 7. It is 

still too large to write explicitly on a single page. To illustrate, we write out the 
first few terms below: 
















(4.19) 
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where Go - ( {l b {l ) in line 1, ^ 
= ( 1 _ 1 _ 1 ) in line 3. 
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( -1 -1 _ 1 ) in line 2, and 


The energies, are the eigenvalues of the matrix. These can be determined 

from the EIG(M) command in MATLAB in a single step. In the appendix to this 
chapter, we give a MATLAB code for the solution of the bandstructure for fee 
semiconductors, using the first 5 lattice vectors. This code has been used to 
produce the bandstructure diagrams in the following figures. 


4.6 Energy Bandstructure of Silicon 

To solve for the eigenvalues of the pseudopotential equations we need to know 
the lattice constant of silicon and the Fourier components of the pseudopotential 

The lattice constant of silicon at 25 °C is 5.43 A. 

The pseudopotential components are: (Cohen and Bergstresser 1966) 

Ufi -0 

U 3 = -2.856 eV 

U 4 = OeV (4.20) 

U R = 0.544 eV 
C M = 1.088 cV 

In Fig. 4.7, we show results for the series of 51 eigenvalue calculations along 
4 different directions in /(-space. In this series we have added the atomic 
potential in increments of 10% of the full atomic potential to illustrate how this 
potential modifies the free-electron bandstructure which is determined only by 
crystal symmetry. The actual electronic bandstructure of silicon is the last figure 
in the series. 
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Fig. 7 a-k Bandstracture of silicon calculated using the pseudopotential method. In this series of 
diagrams, the pseudopotential is augmented in 10 steps, showing how the atomic potential modifies the 
free-electron bandstracture, shown in (a) 























The fundamental bandgap develops from splitting of the 8-fold degenerate 
manifold (Fig. 4.6). Four eigenvalue solutions form the conduction band, and 

the remaining 4 solutions form the valence band The pseudopotential does not 
completely lift all the degeneracy. The energy bandgap of silicon (E = 1.1 eV) is 
the difference between the maximum energy of the valence band, (E = 10.8 eV) 
which occurs at k = F, and the minimum of the conduction band, (E = 11.9 eV) 
which occurs near k = x . The maximum of the valence band energies and the 
minimum of the conduction band energies do not occur at the same point in k- 
space, and thus silicon is called an indirect bandgap semiconductor. This indirect 
bandgap, which occurs at 1.18 eV at room temperature dominates the optical 
absorption properties of silicon. Silicon does have a direct bandgap of 3.41 eV 
which occurs between the maximum energy of the valence band and the 
minimum energy of the conduction band at the r-point. Although the optical 
absorption spectrum is determined by transitions across the indirect bandgap, the 
optical properties of silicon, such as the refractive index and dielectric loss are 
determined by optical transitions across the direct bandgap. 


4.7 Energy Bandstructure of Germanium 

To solve for the eigenvalues of the pseudopotential equations we need to know 
the lattice constant of germanium and the Fourier components of the 
pseudopotential. 

The lattice constant of germanium at 25 °C is 5.66 A. 

The components of the pseudopotential form factors are: (Cohen and 
Bergstresser 1966) 

Uo -0 

U 3 =-3.128eV 

U 4 = OeV (4.21) 

Us =0.136 eV 
U u = 0.816 eV 

In Fig. 4.8, we show the results for a series of 51 eigenvalue calculations 
along 4 different directions in k-space. In this series we have added the atomic 
potential in increments of 10% of the full atomic potential to illustrate how this 
potential modifies the free-electron bandstructure which is determined only by 
crystal symmetry. The actual electronic bandstructure of germanium is the last 
figure in the series. 
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Fig. 4.8 a-k Bandstructure of germanium calculated using the pseudopotential method. In this series of 
diagrams, the pseudo potential is augmented in 10 steps, showing how the atomic potential modifies the 



























free-electron bandstracture, shown in (a) 


The fundamental bandgap develops from splitting of the 8-fold degenerate 
manifold (Fig. 4.6). The pseudopotential alone does not completely lift all 
the degeneracy. 

The conduction band of germanium has a pronounced local minimum at r ( 
k = d). The lowest lying energy band in the conduction band also has local 

minima near X and L, whereas in the case of silicon, there is a minimum only at 
the A-point 

The energy bandgap of germanium (E = 0.8 eV) is the difference between 
the minimum of the conduction band, (E = 10.6 eV) which occurs near k = L, 
and the maximum energy of the valence band, (E = 9.8 eV) which occurs at 
k = E. The maximum of the valence band energies and the minimum of the 
conduction band energies do not occur at the same point in k-space, and thus 
germanium, like silicon, is an indirect bandgap semiconductor. In germanium 
there is a direct bandgap of 0.9 eV at k = r which occurs at an energy very close 
to that of the indirect band gap. However, this is not the least energy of 
separation between the valence and conduction bands. We will see that in the 
next case, for gallium arsenide, this situation is reversed, with the direct band 
gap at k = E being the lowest energy between the valence and conduction bands. 


4.8 Energy Bandstructure of Gallium Arsenide 

Gallium arsenide (GaAs) is cubic crystal with fee symmetry. It belongs to a class 
of materials known as compound semiconductors because of the two different 
atoms that form the basis of the unit cell and the chemical composition. Some 
compound semiconductors are formed by a combination of elements from 
group-III and group V of the periodic table. Other examples of III-V 
semiconductors are InAs and GaSb. Examples of II-VI semiconductors are ZnSe 
and CdTe. These materials share a common structure called zinc-blende. The 
zinc-blende structure can be thought of as two interpenetrating fee lattices. The 
diamond cubic structure of Si, Ge and carbon are special cases of the zinc-blende 
structure when all the atoms in the unit cell are the same element. 

The geometric center of the unit cell is located at the halfway point between 

the two interpenetrating lattices at I 2 { 1 1 O' where ^ is the lattice constant of 

the crystal. In the case of Ge or Si, the atom on either side is the same, of course. 
In the case of zinc-blende materials, the atoms on either side are different, and 
the crystal as a result lacks inversion symmetry. The absence of inversion 



symmetry is a necessary requirement for nonlinear optical behavior. In fact, 
zinc-blende materials are excellent non-linear materials of significant 
technological importance. In contrast, non-linear optical behavior is forbidden to 
the first order by symmetry in Si, Ge and diamond. 

The calculation of the energy bandstructure of zinc-blende materials must 
take into account the reduced symmetry of the structure compared to that of Si or 
Ge. Basically two similar calculations are needed, one for each sublattice. This 
can be taken into account in an efficient manner by writing the pseudopotential 
in two parts, symmetric and anti-symmetric. Denoting the two atomic species by 
B and C, this gives for the form factor of the potential: 

I's (G) = -{Vjj (G) + V c (G)) 

and (4.22) 

Va(G)= 1 -(V b (G)-V c (G)) 

Here it is easily seen that the antisymmetric component vanishes in the case 
of Si, Ge and diamond where atoms B and C are the same. 

The pseudopotential can then be written: 

V (G) = Vs (G)cos (G rd + i sin(G ry) (4.23) 

where l " is the real-space coordinate of an atom in the lattice. 

This approach increases the number of elements in the matrix of equations, 
but the number of unknowns the C(k) remains the same. The MATLAB routine 

takes a few additional milliseconds to extract the eigenvalues. 

To solve for the eigenvalues of the pseudopotential equations we need to 
know the lattice constant of GaAs and the Fourier components of the 
pseudopotential. 

The lattice constant of GaAs at 25 °C is 5.66 A, nearly identical to that of 
germanium. 

The pseudopotential form factors are: (Cohen and Bergstresser 1966) 

Symmetric 

U 0 -0 

U 3 =-3.128eV 

U 4 = OeV (4.24) 

U & = 0.136eV 
l/ M =0,816eV 

Antisymmetric 


U 0 =0 

U 3 = 0,95 2 cV 

U 4 = 0.680 eV (4.25) 

U g = 0 cV 
Un = G.136eV 

In Fig. 4.9, we show the results for a series of 51 eigenvalue calculations 
along 4 different directions in /(-space. In this series we have added the atomic 
potential in increments of 10% of the full atomic potential to illustrate how this 
potential modifies the free-electron bandstructure which is determined only by 
crystal symmetry. The actual electronic bandstructure of gallium arsenide is the 
last figure in the series. 
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Fig. 4.9 a-k Bandstracture of gallium arsenide calculated using the pseudopotential method. In this series 
of diagrams, the pseudo potential is augmented in 10 steps, showing how the atomic potential modifies the 
free-electron bandstracture, shown in (a) 


























The fundamental bandgap develops from lifting of the 8-fold degenerate Gh 

manifold (Fig. 4.6). However, the pseudopotential alone does not completely lift 
all the degeneracy. 

The conduction band of gallium arsenide, like that of germanium, has a 
pronounced local minimum at r (k = 0). The conduction band also has local 
minima near X and L. 

The energy bandgap of gallium arsenide (E = 1.42 eV) is the difference 
between minimum of the conduction band, (E = 10.3 eV) which occurs at k = T, 
and the maximum energy of the valence band, (E = 8.9 eV) which also occurs 
near k = T. Because the maximum of the valence band energy and the minimum 
of the conduction band energy occur at the same point in k-space, GaAs is called 
a direct bandgap semiconductor. 

In most other respects, the bandstructures of GaAs and Ge are quite similar. 
This small difference of a few hundred millivolts is all that keeps Ge from being 
a direct bandgap semiconductor. Because of this fact, there are no conventional 
Ge-based lasers, whereas lasers made from GaAs are commonplace. 


4.9 Materials with Multiple Electrons 

The pseudopotential method shows how the many-electron Schrodinger 
equation, for a material like GaAs or Si, can be replaced by an equation for one 
electron in a pseudopotential This replacement leads to a significant 
simplification in the mathematics required, leading to a set of linear equations 
that yields the energy-momentum relationship. The simplification of Schrodinger 
equation has its basis in observation that the electrons under consideration can be 
divided into valence electrons and inner core electrons. The electrons in the 
inner core states are tightly bound to the nucleus, and do not participate in 
bonding, electrical conduction and low-energy (<6 eV) photonic transitions. On 
the other hand they partially screen the nuclear charge from the valence 
electrons. The pseudopotential method is expected to work well for materials 
like metals and semiconductors having at least one complete inner shell. 
Diamond, for example, does not meet this criterion, and an evaluation of the 
bandstructure of diamond calculated by the pseudopotential method is included 
as an exercise at the end of this chapter. 

Pseudopotentials can be determined from first principles from the electronic 
charge distribution which defines the structure of the material. Finding the 
equilibrium structure means calculating the ground state energy of the ensemble 
of atoms under study. This calculation involves accounting for the multi-body 



interactions between all the electrons and nuclei. 

Hohenberg and Kohn (1964) proved that there is a unique charge distribution 
that gives the correct ground state energy. Stated simply: In a system having N 
electrons, the ground state energy does not depend on 3N coordinates of the 
electrons, but rather on the charge distribution , which has only 3 spatial 
coordinates. The ground state energy is a functional of the charge density, giving 
rise to the name density functional theory . This simplification has 
revolutionized the determination of the structure of complex molecules. Next, 
Hohenberg and Kohn proved that the correct ground state electron density 
minimizes this energy functional. 

Almost all materials, including most of the elements on one hand, and 
complex organic molecules on the other, are multi-body systems. Computations 
on such systems represent a considerable challenge to both mathematics and 
physics. Yet, common proteins, active in the human body, know how to solve 
this problem by folding into a minimum energy configuration in a fraction of a 
second. To appreciate the challenge, we shall examine the Hamiltonian for a 
multi-electron system of materials. Such a system is made of M nuclei and N 
electrons. For simplicity, we will assume that the nuclei are frozen in place, and 
consider only the contributions that electrons make to the overall energy. 

First of all there is a component of the total energy coming from the kinetic 
energy of the electrons: 


N 


kinetic — ^ 



There is a term coming from the potential of the nuclei: 


(4.26) 
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There is a term coming from the interaction between the electrons 
themselves: 


H 





The Schrodinger equation is: 


(4.28) 


{H^ sk + T = OT, (4.29) 

where T = T(ri, r 2 ... r.v) is the electron wave function that depends on each 

of the coordinates on the N electrons and e is the ground-state energy of the 
system. 




However, solving (4.29) is impeded by the fact that this differential equation 
is not separable, because the electron-electron interaction depends on 177 = 7 : 7 . 

To proceed further, we could propose, as an approximation to substitute an 
average potential = v *v U‘d for each electron in the place of this term. On this 

basis: 


N 
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(4.30) 


That is the Hamiltonian can be reduced to the sum of individual 
Hamiltonians for each electron. In this approximation, the wavefunction is 
expressed as the product of the individual one-electron solutions 


hitf/i (rj) = eitffi (ri) 


OT = ^^ l (r,)^ 2 (r 2 )...^(r jV )} 

(4.31 

= (fi| + + ■ ■ ■ + £]\') {<^1 ( r l) $2 ( r :) ■ ■ ■ &N ( r jv) 1 

The proposed wavefunction, 'P = {^1 (ri) ^ 2 ( 1 ^) ■ ■ ■ $n U\v)I , called a Hartree 

product, is not an acceptable choice because its structure does not comply with 
the Pauli exclusion principle , namely that it is not antisymmetric with the 
exchange of electrons. However, another combination of one electron 
wavefunctions could serve as a solution to (4.31) and satisfy the Pauli exclusion 
principle: 
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(4.32) 


Such a wavefunction is called a Slater determinant. A wavefunction of this 
form could be used as a solution for (4.31). Note that it contains N\ terms, each 
term depends or 3 spatial variable as well as spin. If we were to try to solve the 
Schrodinger equation for a cluster of only 10 atoms, we would have to imagine 
dealing with hundreds of electrons, depending on the atomic number of the 
elements in the cluster. If we take the example of silicon, with N = 14, this 
means treating 140 electrons, and more than 560 dimensions including spin. The 
number of terms in the Slater determinant would exceed 10 241 . It would require a 
great deal of patience to achieve a result for systems having more than a handful 
of electrons. 







This brute-force analytical approach leads to dramatically increasing 
complexity, and it would be better to look for an alternate route to the principal 
objective which is to determine the ground-state energy of the system. In order 
to derive the energy eigenstates of the Schrodinger equation, it is necessary to 
have a set of wavefunctions. In order to determine the correct ground-state 
energy, it is necessary to use the correct wavefunctions. Finding these 
wavefunctions is an iterative process. In many cases, we are not even interested 
in the wavefunctions of the electrons. These are a means to an end. As in the 
case of the pseudopotential method, we should develop a different approach by 
searching for a variable that reflects the behavior of electrons in the ground- 
state. 

In the face of this situation, Hohenberg and Kohn proposed in 1964 such an 
alternative analysis. They proposed and proved that 

• There exists a variable which is a function of the electron charge density 
distribution. 

• The energy of the multi-electron system depends on this variable. 

• The minimum of this energy is the true ground state energy of the system. 

Their proof, which we reproduce below is exemplary for both its elegance 
and clarity. 

4.9.1 The Hohenberg-Kohn Theorem and Density 
Functional Theory 

There is a universal functional F \p 0)1 of the electron charge density distribution 
P O' that defines the total energy of the electron distribution by 

E = J v WP O) dr + F ^ 0)1 (4.33) 

The total energy e has a minimum when the charge density /-’O') coincides 
with the true charge density distribution in the potential v ( r ). 

The electronic charge density distribution pO) is related to the true -body 
wavefunction T{ri,r 2 ... r jV ) v i a 

P (r) = J *P* O' i ■ r 2 ... r jV )T (r , . r 2 ... r jV ) dr t dr 2 ... dr N (4.34) 

Then the expectation value for the one-body potential v O) is 


(4.35) 


r N 

WVm = J p(r)v(r) dr = g <T|v(r? |T ) 

The true wavefunction is a unique functional of the external potential 

'¥(r u r 2 ...r N ) = F>(r)| (4.36) 

The charge density is a unique functional of the external potential 

p<r) = r><r)| (4.37) 

On the other hand, is it true that the potential 'TO is a unique functional of 

the charge density distribution? 

To answer this question let us suppose that a different potential T (■? gives 
the same ground-state charge density distribution Pi 1 ?. 

Let the Hamilitonians whose potentials are given by v (0 and i l ? be given 
by and T ( respectively. Similarly, let us write their eigenvalues and 

eigenfunctions as e and E" and ¥ and v. 

We have 


<HV = E¥ 
and 

H r V f = E t] V' 

From the variational principle of the ground state, we expect 


E f = <'P|W'I'F> 

<fr//TT) = <T|// + V'- V\V) 


E e < <T| H 



And simultaneously, 


v (r)]p (r) dr = E + 



(r) - v (r)]p (r) dr 


(4.38) 


(4.39) 

(4.40) 


E< (TL//|T> < <^r W IT) 

E < H |T f ) + J [v (r) - v* (r)]p (r) dr = E f - jT [v f (r) - v (r)]p {r)dr (4.41) 

From these 2 equations we must draw the self contradictory result: 

E' + EkE + E* , 1 , . t ( 4 - 42 ) 

So we must abandon the original presumption. Thus v m? is a unique 

functional of Pi 1 ?. Since v ( r ) determines the form of the Hamiltonian ?? 

uniquely, then the wavefunction y in the ground state is a unique functional of 

P<r). 



It follows that we can write the ground-state energy as a unique functional of 
the charge distribution, which will depend on the wavefunction and the 
Hamiltonian 'H = T + U } where T represents kinetic energy and U represents the 

interaction energy between electrons. 

= + (4.43) 

The energy of the system of electrons is expressed as a functional of the 

charge density 

£, ]p (r)| = J v (r) p (r )dr + F |p(r)| (4.44) 

This functional can be minimized by an appropriate iteration algorithm, 
yielding the ground state energy. In a few more lines, Hohenberg and Kohn show 
that the electron density p( r ) that minimizes (4.44) is the true electron density. 

This result is an intellectual breakthrough that has dramatically changed the 
course of chemistry and physics. While the brute-force approach, which 
determines a wavefunction for each electron and accounts for the interactions 
between all the electrons, would produce a correct result for the energies of the 
systems of particles, this method cannot be applied to real systems because it is 
too time-consuming. 

Hohenberg and Kohn showed that there is another route to the answer. The 
charge density distribution depends on 3 spatial variables (4, if spin is taken into 
account), no matter how many electrons are involved. The ground state energy 
can be determined by minimizing a functional that depends only on this charge 
density. We remember that the actual particle cluster or molecule under 
consideration shows us yet another way to determine the ground-state energy in 
less than a picosecond without recourse to the brute-force procedure either. 

4.9.2 The Kohn-Sham Equations 

The paper by Hohenberg and Kohn showed that there is an alternate route to 
determination of the ground-state energy and the importance of the charge 
density, but it did not show how to derive the charge density distribution that is 
necessary to make the approach practical. A procedure was given in the 
following year by Kohn and Sham (1965). These two seminal papers form the 
basis of density functional theory . 

Kohn and Sham confirmed the importance of decoupling the interactions 
between electrons that we evoked in (4.29). However, instead of substituting an 
average interaction potential, they introduced the charge density function P( r ). 



(4.45) 


That is, the electron-electron interaction is replaced by the interaction 
between an electron and the charge density function. With this substitution, the 
Schrodinger equation can be separated into a system of independent and linear 
one-electron equations. These electrons are not “real” electrons, but rather Kohn- 
Sham non-interacting electrons. A simple Hamiltonian for each electron can be 
written down as: 


Hk-s = T <r) + V <r) + <r) + V exCh (r) 

where 


(4.46) 


T (r) is the kinetic energy 
V (r) is the Coulomb potential of the nuclei 

and Veick (r) is an electron exchange-energy term that corrects for the self¬ 
interaction of an electron included in the expression of (4.45), as well as 
correlation effects that need to be taken into account because they are ignored by 
the non-interacting assumption. This term is the subject of numerous studies and 
is often represented by an empirical model appropriate to the system under study. 
The Kohn-Sham equations, one for each electron can be written down: 



(4.47) 


This set of linear uncoupled equations can be set up in matrix form and 
solved for the eigenvalues, and the set of fictitious non-interacting 
wavefunctions can be determined. The charge density is determined from the 
summation: 



Of course, the fictitious wavefunctions do not represent an approximation to 
the real electron wavefunctions, and the eigenvalues determined by this process 
do not correspond to the real energy states either. What Kohn and Sham proved 
is that the electron density determined in this process, by using a recursive 
calculation can be made to converge to the real electron density (Lesar 2013). 
This is the key result that is needed to exploit the Hohenberg-Kohn theorem. An 
iteration procedure is in Fig. 4.10. 







Fig. 4.10 Flow diagram for a recursive solution of the Kohn-Sham equations 

4.9.3 Using Density Functional Theory to Analyze Real 
Systems 

Realistic modeling and simulation of real atomic and molecular systems involves 
significant computational resources, even with the benefit of density functional 
theory. There are a number of centers which have developed computational 
infrastuctures for such studies, some of which can be used by interested 
scientists. Scholl and Steckel (2009) have written an excellent introduction to 
density functional calculations for the prospective user that make use of 
available computational platforms. These platforms provide an entry to density- 
functional analysis without having to reinvent the software routines needed for 
implementation, thus saving years of effort to develop and de-bug the 
calculational routines. Two such initiatives are mentioned here: VASP and 

















Quantum Espresso . Additional software routines can be found at 
SourceForge.net. https://sourceforge.net/directory/os:windows/?q=density%20 
functional%20theory. 

The Vienna Ab initio Simulation Package (VASP) is a computer program for 
atomic scale materials modelling, e.g. electronic structure calculations and 
quantum-mechanical molecular dynamics, from first principles. https://en. 
wikipedia.org/wiki/Vienna_Ab_initio_Simulation_Package. 

VASP computes an approximate solution to the many-body Schrodinger 
equation, either within density functional theory (DFT), solving the Kohn-Sham 
equations, or within the Hartree-Fock (HF) approximation, solving the Roothaan 
equations. Hybrid functionals that mix the Hartree-Fock approach with density 
functional theory are implemented as well. Access to VASP tools and 
capabilities is by paid subscription and comes with significant support from 
VASP. 

Quantum ESPRESSO is an integrated suite of computer codes for electronic- 
structure calculations and materials modeling, based on density-functional 
theory, plane waves, and pseudopotentials (norm-conserving, ultrasoft, and 
projector-augmented wave), http://www.quantum-espresso.org/. 

Quantum ESPRESSO stands for “opEn Source Package for Research in 
Electronic Structure, Simulation, and Optimization”. It is available at no cost to 
researchers around the world under the terms of the GNU General Public 
License. Quantum ESPRESSO builds upon electronic-structure codes that have 
been developed and tested by some of the original authors of novel electronic- 
structure algorithms and applied in the last twenty years by some of the leading 
materials modeling groups worldwide. As an open-source project, there is no 
paid support service. There are a number of public user-groups. Interaction with 
these groups may provide a level of support for understanding and using the 
software. 


4.10 Summary 

The electronic bandstructure of many semiconductors of technological 
importance can be calculated using the pseudopotential method . The power of 
the pseudopotential method resides in its ability to substitute a fictitious potential 
that accounts for the behavior of the electrons in the weakly-bound valence band 
states. In the case of germanium for example, the interactions between the 
electron of interest and the nuclear potential as well as the interactions between 
the 32 electrons that make up the atom are summarized in a potential that can be 
expanded in a Fourier series based on the periodicity of the crystal lattice. Only a 



few terms of this expansion are needed to give an electronic bandstructure that is 
in excellent agreement with experimental measurement. 

A pseudopotential can be calculated from first principles if the ground-state 
configuration of the material can be determined. Hohenberg and Kohn proved 
that the ground state configuration of an ensemble of atoms, such as a crystal or 
a molecule, is a functional of the correct electron charge density: that is, a 
function of three spatial dimensions. Density functional theory has enabled the 
determination of the ground-state configuration not only of semiconductors 
whose structure is relatively simple, but more importantly of complex molecules 
and proteins with relevance to biochemistry. In 1998 Walter Kohn received the 
Nobel Prize in Chemistry for the development of density functional theory. 


Exercises 

4.1. Prove that the energy eigenvalues of ^ (k, r) are identical to those of 

T (k + G*, r). 

4.2. Graphene is a material of great technological promise. It was 
studied as a model system for many years before and Andre Geim 
and Konstantin Novoselov showed how to make real samples. Their 
discovery was recognised by the 2010 Nobel Prize in physics. 

A diagram of the graphene structure is shown in Fig. 4.11. Note 
that the unit cell contains two carbon atoms. 
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Fig. 4.11 Crystal structure of graphene in the x-y plane. Graphene is a 2-dimensional 
structure in a 3-dimensional world. Thus, the 3-dimensional structure is completed by 
supposing parallel sheets of graphene separated in the 3rd dimension by z = c 


4.2a. Using the primitive vectors a <, and a 3, find the three basis 

vectors that span the reciprocal lattice. Show that the resulting 
structure is independent of the spacing between parallel sheets, and 
becomes 2-dimensional in the limit of very large c. In the 2- 
dimensional limit, make a diagram of the points in reciprocal space 
obtained by adding and subtracting the two basis vectors. 

4.2b. Make a graph of the Brillouin zone for graphene, and label the high- 
symmetry corners and faces. Include the coordinates of these points. 

4.3. Using the results in Exercise 4.2, make a calculation of the free- 
electron bandstructure for graphene. Compare your calculation with 
the results of Fischetti et al. (2013). 

4.4. Calculate the energy band structures for Ge, using MATLAB, and 
incrementing the periodic potential from 0 to 1 (full potential), as 
shown in Sect. 4.7. Integrate your results into a video animation 
using the moviemaker function of MATLAB. 

4.5 The pseudopotential method is based on the assumption of the 

separation of electrons into two categories: valence states and core 
states. The core states are contained in a complete and closed shell. 
The assumption is valid for silicon. In the case of diamond the core 
states are the Is and 2s electrons. The valence states are the 2p 
electrons. In this case there is less complete screening of the nuclear 
potential. 

Give an evaluation of the limitations of the pseudopotential 
method using diamond as an example. 

4.5a. Perform a calculation of the bandstructure of diamond using the 
pseudopotential form factors proposed by Saslow et al. (1966) 


V||| = -11.0296 
VS 20 = 4,58 
V 3M = 1.7952 
1 /777 = 0.5576 

In order to accommodate the ^222 form factor, you will have to 
modify the Matlab code. 

4.5b. Compare your result to more recent calculations of the diamond 
bandstructure, for example Calzaferri and Rytz (1996). What 
adjustments could be made to the pseudopotential form factors to 
give better agreement with the direct and indirect bandgaps? 

4.5c. The ^222 form factor should be identically zero by symmetry. 

Criticize the rationale of Saslow et al. for proposing a non-zero 
value for this form factor. Does this feature improve the overall 
result in a significant way? 


Appendix: Source Code for Bandstructure Calculation 
Using MATLAB 



% This pregrain calculates band ^tr^ctura pf a F-C-C iu.bTC3.aL using 

% pseudopotentials to model the potential due to atoms at lattice sites in the 
% material. The program plots E vs k for four k-directions with calculations 
\ through the C311J- family of reciprocal lattice vectors. 

% This is a Matlah program. In order to run it you must save the file with 
% a . ro. extension. After loading Hatlah, type the filename without .m and i t 
% will iun, 

% EKimpl e : 

% >> filnnaiiiH- 

n-r-def ined var lablcs . 


clearj 

Cm±=11; 'it'-aicimum of desired jC 

% Syiraivetrie and antisymmetric Fourier coefficients of pseudopotentialc 
% as examp Is3 

blank = 1 H j 

introl - "Please enter the symmetric and anti-sysmetrio potential 1 ; 
intzo2 s "coefEicients corresponding to the proper values of IG| A 2 S ; 
introl - 'in eV, The prompts will have the form ,, V-tymClJ ai ; 


disp £' 

disp i.' frtatei: ial 

Pseudopotential Coefficients in #V" 3 
V“cyra{3 3 V-cyritBJ V“sym[113 V-Asym£3h 

V— a sy m {4 3 

V-asymSllJ 








disp£ r Si 

-2,356 

0.544 

1.008 

0 

0 

0 r 3 ; 

disp £ 1 Ge 

-3.128 

Q, 136 

0,016 

0 

0 

0 1 3 ; 

disp £ r Sn. 
disp £ 1 'Jr 

-2.720 

0 

0.544 

0 

0 

Q 1 3 ; 

disp t 1 CaP 

-2.452 

4-408 

4-482 

L - 612 

4-952 

0 - 2 72 1 3 ; 

die? £ 1 GaAs 

-1-128 

4-138 

4.816 

0-452 

4-680 

0-136 1 3; 

dispf ALSb 

-2-856 

4,272 

4,816 

0 - $L6 

4,644 

0-272’?; 

disp£' InP 

-1-128 

4-138 

4-816 

0-452 

4-680 

0-136 1 3; 

disp(' CaSb 

-2-492 

4-0 

4-680 

0 - $16 

4-680 

0-136 1 3; 

disp ( 1 InAs 

-2-492 

4,0 

4-680 

1-483 

4-680 

0-40$ 1 3; 

disp £ 1 InSb 
disp £ 1 1 J ; 

-2.720 

0.0 

0.54.4 

0.816 

0.630 

0.136 H 3; 

disp £” £nS 

-2.992 

0.40B 

4.952 

3.264 

1.904 

0.544 1 3; 

disp(* £nSe 

-3.120 

0.136 

0.816 

2.44B 

1.632 

0,408’3; 

disp( 1 £nTe 

-2.992 

0.0 

0.680 

1.768 

1,360 

0.136 ■ 3; 

disp £ h CdTe 
disp £ h 1 3 l 

-2.720 

0.0 

4.544 

2.040 

1.224 

0.54d■3; 

disp £ ' —--: 

diup I'blank ) j 
diup(blank3 j 
diup £introlJ; 
disp(intro3 J; 
disp(introl3; 






■ 3; 


v1 = input[ 1 V- 5 ym£1 \ = 

v2 = input £ 1 V-sym(dI = ')? 
v3 =• input £ 1 V-sym(SI = 
v4 « input( 1 V-sym(113 - K ); 
v5 = input £ 1 V-antisym(3^ - 1 >; 
v6 = input £ 1 V" an tisym( d |* - 
v7 = input£ 1 V-antisym£0^ - 1 ); 
v3 = input £ 1 V-antisym£11) = 1 ?; 










C= input,{ 1 Enter the potential scaling factor (Q=no potential, l=full potential) : 1 ); 
% Scaling factor to r potentials- 
4 c =0 is free electron, c=l is to.ll potential - 


S_pot_3 =C*(vl); 

A_pot_3 —C*( v5F 

A_po-t_■! s.C T ( vG^ 

S_pot_4 =CM v2fr 
5_pot_fl =CM v3* 
S_pot_Ll=CM V4t 
A_pot t _fi =C* { v7 } 
A_pot^L 1 =C M vB |< 


*G = (111}; 

*G = (200); 

%C = (220); 
%G = Ull); 


lattice= input{*lattice cons 
lattice=laccice*lfl-lO; 

sample = 4 D; 

1 „nuiii=S; 

Emax =17; 

Emi n= 4 r 


tant in angstroms 1 }; 

^Lattice constant in m. 
fcSampLing painta par log. 
^Plotting Lug numher- 
^Plotting energy [maximum. 
%Plotting energy minimum- 


£2= [1/2 “1/2 1/2; 000; L 00; i I/d -l/i;0 3/4 3/4; 0 0 0]; %DeEine zone 
T= (1,054e-34*2 *pi/lattice)^2/(2*9.le-ll *l -0e-l9i; 
i=sgrt(-1); 


%5tart calculating 

%Find desired G's 


count=1; 

for nl =-3:1:3; 

for n2=-3;i;3 


cor n3=-3:1:3; 

x=nl-n2+nl ; 
y=ni*-n2-nl; 

Z--hl+ i n2*n3; 

if {^2 ^y^2 +z A 2) < =Gmax; 
G (count „ 1 !■ =X; 

C(count r 2k=y; 

C(count r 3>=z; 
count-count+i; 

end; 

end; 


end; 

end; 

vttc_mua=comit-i; fcTetal desired G number, 

H ( vec jium r vec_num) ±;0„■■ %Ees e t mat r i x 


%5et up off diagonal elements, 
for col umn = 1:1: V 0 c_num; 

for row=COlumn+1: L: ^ec_num; 

x=G (column, 1 1 -G (row, 1 ) r 
y=G(cclumn r 2 i -c (row, 2); 
s=G(column r 3 i -G (row f 3); 
potential=0; 
potential2 = 0; 

Leng2^ jc A 2 *y * 2 + z A 2; 

L=pi/4 F [x+y+z] ; 

if leng2==3, 

potential=£_pot _J*ccs (L) +i*A_pet_2*sin(L); 
potentiaL2-S„pot ra 3■ cos(-LJ + i r A^pot_l T 3in(-L); end; 
if Ieng2 = =-4 J 

potent ia,l=£_pot__4* cob (L) + i 4 A_pot_4 1 sin(L); 
pot ent i a L 2 =S_pot_4 *co a(-L1 + i* &_pot_4 ’ s in (-L); 


boundary. 


end; 




if leng2 ==0 r 

po t enbial ■ S_po t_B * cos {L) + i * A_po1_3 * s in {L -) ; 
pGtential2«S_pGt_£*eoE[-L)+i*A_pot_0 # ein(-L); end; 
if leng2*=ll, 

po ten tia1■S_po t M 1L * cos[ L)+i*A_pot_l1 * sin(L $; 
pGtential2=$_pot M LI*cos<-L) +i*AwPot_l l *ain(-L ); end; 
M (column r row) potential; 

M(row, OOlumn)=poten t i*12; 

end; 

■end; 


% EigenvaL ue ca1cu1a tion r ou tine. 

pointa*Q; 

ks ■ sqrt(3/3); tScaling factor for k values, 

3b = 0 ; 

interval = 0? 

for lAQ E l: 1: l€g_num; 

if 1eg-=4, 

X_infc*rv*l= (BZ(leg+1 r 1) -BZ (leg, 1)} /sample; 

Y_interval= (BZ (leg+1, 2 ? -BZ (leg,2 H/sample; 
z_i n te r va 1 »■ (BZ (leg+1, 3 ) - BZ (leg, 3) ] /sampl e; 

if Heg==2, zb = Mleg-l,peinta+l); interval = Q; ks = 1-0; end; 
if 1eg==3, ks = sqrt(l/8); sb = k(leg-1 r pcinta+1); interval = 0; end; 
if leg==5, ks = sqrtO/S); sb=kfleg-2 r pointa+X); interval=0; end; 
pointa = 0; 

£ or n = 0 1 1:sample; 

x«BZ(leg, LJ +n*x_intervai; 
y-BZ (1 eg, 2 ) i-n * y_in L e rval ; 
z"BZ(leg,3 )+n* z^interval; 

for 1:1 1 vec^num; %£et up diagonal elements 

=T*((G(m,l)+x>*2 +10(m P 2)+y)*2* M*{ni r 3)+c) *2); 


end; 

Egl = eig(M>; 
E#=sort(Egl); 


for m=1:i:vec_num; 

if leg■»1, 

pi (pcinta + 1, m) ="Eg (m) ; 

end; 

if 

legs-2, 

p2(poin ta +1,m)* Eg(m); 

end; 

if 

leg==3, 

p3 (pointa + 1 ,m) =Eg(m) ; 

end; 

if 

leg = ■5 1 

p4(pointa+1,m)= Eg(m); 

end; 


end; 

k(leg r pointa+l)=sb+(ks^interval/sample); 

pointa=pointa+1; 

in t er val = i n te r val+1; 


end; 

poin ta »point a-1; 
end; 


end; 


^plotting routine, 

plot<k(l r :),plp pk(2,:),p2,pk(3i:)ip3ppkCS,:),p4,---l; 
axiE([0 d 3.3, -3, 30]); 


set (gca r 'FontWame',"Symbol 1 ?; 

xlabel( N L G C 

K G 1 ) ; 

s*t (gca r "FontName 1 , F Helvetica 1 ); 

ylabel('E (eV) 1 ?; 

titled"Band Structure F ); 

if leg»*l, pi(poanta + 1,. m) »Eg(m); end; 
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Abstract 

Maxwell’s equations give a description of electricity and magnetism in terms of 
the electric and magnetic fields, but say very little about the quanta: electrons 
and photons, that constitute the fields. This field description treats electric and 
magnetic phenomena like continuum fluids. By quantization, we can treat 
electromagnetic phenomena from the perspective of the quanta. This is the basis 
of quantum photonics. We show that photon behavior can be described 
mathematically by the formalism of the harmonic oscillator. This important 
principle enables the description of single photon; that is, quantum photon 
behavior and the quantization of the electromagnetic field. Creation or 
annihilation of photons is the way that the electromagnetic field exchanges 
energy with matter. The creation and annihilation operators can be used to 
describe the emission and absorption of photons in a direct and transparent way, 
without having to derive the quantum wavefunction in explicit form. We use 
these operators to analyze stimulated and spontaneous emission (and absorption) 
of photons, and thus the performance of Lasers, LEDs and photodiodes. 


5.1 Introduction 

The time-independent Schrodinger equation for a particle in the presence of a 
one-dimensional potential is written: 




(5.1) 




The total energy e is shared between kinetic (-£) and potential (^ (-*)) 

energies. In Chap. 3, we examined the properties of a particle confined by a 
constant potential (particle in a well). In this environment, the allowed quantized 
total energy scales according to the square of the quantum number: 

■ 2zsr- ( 5 - 2 ) 

where l is the width of the potential well. The dependence of the allowed 
energies on the quantum number is a function of the geometry of the potential. 

In this chapter we will study the behavior of a particle confined by a 
potential well, the width of which varies as a function of distance. 

A simple example of a harmonic oscillator is the movement of a mass 
connected to a spring. When displaced from equilibrium the spring exerts a 
restoring force that is proportional to the displacement: = ~ Kx . The 

corresponding potential energy stored in the spring is: v (-*) = when 

released, the mass begins to move, and energy oscillates between reservoirs of 



kinetic and potential contributions with a frequency: OJ 


The vibration of atoms in a crystalline solid is an important example in 
physics which can be treated accurately by harmonic oscillator analysis. 

Maxwell’s equations describe the propagation of a photon as the periodic 
exchange of energy stored in its electric and magnetic field. We will argue in 
Sect. 5.4 that photon behavior can be described mathematically by the formalism 
of the harmonic oscillator. This important principle enables the description of 
single photon; that is, quantum photon behavior and the quantization of the 
electromagnetic field. This is the subject of this chapter. 


5.2 The Harmonic Oscillator 

The Hamiltonian operator for the total energy of a simple one-dimensional 
harmonic oscillator is: 

2m 2 

The time-independent Schrodinger equation is 


(5.3) 






(5.4) 


For the case of a quantum harmonic oscillator , (such as an electron confined 
to a quantum well having a parabolic potential), we can write: 



(5.5) 


(5.6) 


The solution to this equation is straightforward if somewhat labor-intensive. 
For the details, we refer the reader to any standard text on quantum mechanics, 
several of which are given in the references to this chapter (Dirac 1930; Landau 
and Lifshitz 1958; Merzbacher 1960; Yariv 1975). 

The elements are as follows: The eigen functions are composed of the 
product of a Gaussian function that declines to zero at x = ±co , permitting 

normalization of the wavefunction, and an oscillating polynomial part inside the 
potential well where the particle is confined. 

■kr-All- _ 

¥{a) = C\e 2 Fix), where C is the constant of normalization. (5.7 

P 00 is a polynomial of order n for the rtth energy level. 

These polynomials are called Hermite polynomials , ( ax ). The first few 

Hermite polynomials are: 


Il n {ax) = 1 
H\ (ax) = lax 


(5.8) 


H 2 (ax) = 4 (ax) 2 - 2 

However, as is often the case, we are not as interested in the wavefunction as 
we are to know the quantized energies of the particle. Using Schrodinger’s 
equation, we can solve for the energies: 




(5.9) 








The stable energies or modes of the quantum harmonic oscillator are equally- 
spaced in energy A{£„ - E n - 0 = htu This is a consequence of the quadratic 

dependence of the potential on distance. The energy level quantum number n 

can be any positive integer, including zero. In the case of lattice vibrations 
(phonons) and photons, the quantum number n also gives the number of quanta 

present at a particular frequency Note that there is an energy of 

associated with the vacuum state where no quantum is present. The proof of this 
result is the subject of Exercise 5.1. This is a manifestation of the Heisenberg 
uncertainty principle which states that the product of the minimum uncertainties 
is 

A/>Ai = ? (5.10) 

The Heisenberg uncertainty principle also teaches that the result of the 
measurement of position, followed by measurement of momentum is not the 
same as that of measuring first the momentum and then the position. That is, the 
position and momentum operations do not commute. The commutator of P and 

x is the mathematical expression of this result: 

[p, x] T = p (jr¥) - x (pT) = —ih— (jcT) - x |-(7i—T | = -ift f'P + x —¥) + ihx—~ = -iTt'V 

dx \ dx j \ dx I dx 

Therefore we write: 


[p, x] = -&t 


(5.11) 


5.3 Raising and Lowering Operators 

We introduce two operators: a and a . 

+ a Ida i 

" = V5‘ r “ = vf r ” a» V 2 p 

where P ^ Bx is the quantum mechanical momentum operator 

a 1 d a i 

a = —x + —= —x +--f/> 

Vl a 'fcfa V2 aft V2 

Note that a+ and a do not commute. The commutator: 

= l 

The proof of which is the subject of Exercise 5.2. 


(5.12) 


(5.13) 


(5.14) 




The particular operators and a have been developed in particular for 

study of the harmonic oscillator (Dirac 1930). They enable us to extract many 
properties of the eigenfunctions without having to write down the eigenfunctions 
explicitly. Thus, we can study the energy eigenvalues without having to solve for 
the eigenfunctions. 

We will state the principal properties of these two operators. A full derivation 
can be found in the references on quantum mechanics that are cited in this 
chapter. 

1. If is an eigenfunction of the harmonic oscillator, then 

a + 4> n = (V«TT)^j + | (5.15) 

That is, the operator A transforms a given eigenstate to the next higher 
eigenstate multiplied by the square root of its occupation number. 

2. If is an eigenfunction of the harmonic oscillator, then 

a&n = (V«) 0n-i (5.16) 

That is, the operator a transforms a given eigenstate to the next lower 

eigenstate multiplied by the square root of the occupation number of the original 
state. 

The operators a ' and a are referred to as raising and lowering operators for 
obvious reasons. 

Next, we solve (5.12) and (5.13) for P and x . 


ttm + 

p = —(a - a) 

V2 

(5.17) 

jc = 1 {ct + a) 

a V2 

(5.18) 


Substituting these expressions for P and x into (5.3) we obtain; 

f H = T ^-{aa + +a + a) 

Using the expression for the commutator of (5.12) 


(5.19) 




(5.21) 


Finally, we use (5.15) and (5.16) to show (see Exercise 5.4) 

So that we find directly the results of (5.7) 

= + (5.22) 

The operation a+ gives the number of quanta in the eigenstate . 

Since the total energy is directly related to the number of quanta, we have two 
important results that characterise the harmonic oscillator without having to 
work explicitly with the eigenstate wavefunction itself. This is an immense 
simplification that enables a clearer view of the physics. Next, we will apply this 
methodology to the characterisation of photons as the quanta that constitute the 
electromagnetic field. 


5.4 Quantization of the Electromagnetic Fields 

Maxwell’s equations give a description of electricity and magnetism in terms of 
the electric and magnetic fields, but say very little about the quanta: electrons 
and photons, that constitute the fields. This field description treats electric and 
magnetic phenomena like continuum fluids. By quantization, we can treat 
electromagnetic phenomena from the perspective of the quanta. This is the basis 
of quantum photonics . 

A laser (light amplification by stimulated emission of radiation) is a device 
whose operation is based on the behavior of an electromagnetic field inside an 
optical resonator cavity. Laser emission is based on the presence of both 
spontaneous and stimulated emission . Stimulated emission does not occur on its 
own. It requires the presence of a photon. These photons are generated by 
spontaneous emission . A laser cannot function without the occurrence of 
spontaneous emission. 

The macroscopic description given by Maxwell’s equations is inadequate to 
describe spontaneous emission, whereas, spontaneous emission is a natural 
consequence of the quantization of the electromagnetic fields. We will show here 
that the quantum analysis of the electromagnetic field inside a laser cavity is 
formally equivalent to the treatment of the harmonic oscillator. This results 
directly from the fact that the electromagnetic energy oscillates between electric 
and magnetic fields in a way that is mathematically identical to the oscillation of 
energy between kinetic and potential components in a harmonic oscillator. 

The Hamiltonian operator for the total energy is: 



■w=i f <//H • H + eE • E>iV (5.23) 

We seek conjugate variables p and Q that will let us write the Hamiltonian 
operator in the form: 

'H = J\li+c 2 Q: (5.24) 

With the desired commutator properties : 

= (5.25) 

and 


[Q a ,Q b \ = [F a ,r b \ = Q > (5.26) 

Of course Q“ and ^ no longer correspond to position and momentum. We 

wish to describe elementary excitations of the electromagnetic field. These are 
photons, and they have neither mass nor position as basic properties. 

Without any loss of generality, we can suppose that the electric and magnetic 
fields exist inside a volume of arbitrary dimensions. This volume forms a cavity 
. To develop the formalism, the electric and magnetic fields can be expressed as 
an expansion of the cavity modes : ^ ( r ) and ^ ( r ), which are orthogonal by 

design. We can propose 

H (r. r) = Q a (0 H a (r) (5.27) 

and 


E 


(i\ r) = - V -^—P a {;) E^ (r) 


where as usual w 


(5.28) 


Substituting (5.27) and (5.28) in (5.23), and using the orthogonality of cavity 
modes, we obtain: 


= ]T y (t) + uiQi (t)) (5.29) 

It can be seen by comparison with (5.3) that (5.29) is expressed in the desired 
form for a harmonic oscillator . By considering ^ and as canonically 

conjugate variables , we can use all the results (5.10 through 5.22) to quantize 
the electromagnetic fields, based on the individual quantum: that is, the photon 
(Grynberg et al. 2010). 

Following (5.11), the basic commutator is: 


Qb\ — thS a if 


(5.30) 


We define raising and lowering operators a ] and a i. These operators are 
time-dependent, because (0 and Q« (*) are time-dependent. 



(5.31) 

and 


M*) = + )] 

(5.32) 

As is the case with (5.14), the commutator \ a i ( T )- a J (*)] = 1 . 
Solving for p and Q, 


9 [a* {*) at (r)] 

(5.33) 

and 


■h-u 

3" 

+ 

+-•* 

+ 

II 

(5.34) 

Substitution of these expressions into (5.24) gives straight-away: 


(tfat + 

(5.35) 

The eigenfunctions of this Hamiltonian are the stationary states of the 
electromagnetic field, for which total energy is conserved as it oscillates between 

electrical and magnetic fields. (The time dependence is of the form f/ ", and can 

be treated separately). 

If At is such a stationary state, with characteristic frequency = to >' 

, then: 

<At = V« + lAt+i 

(5.36) 

= V#rt-I 

and 

(5.37) 

< W*) = 

(5.38) 


These results give a complete description of the electro-magnetic field from 
the quantum; that is, from the photon point of view. A given stable state of the 


field = A* consists of n quanta or photons. Each photon has an energy = The 
energy of the state is the sum of the energy of the photons, plus Photons 










are excitations of the electromagnetic field. Excitations can be created or 
destroyed. On the other hand there is no mechanism for interaction of these 
excitations. 

The operator changes the stable state fa to the stable state fa+i . This 
action creates an additional photon of frequency because the state fa+i must 
have n + 1 quanta. For this reason, we can call the a ] the creation operator . In a 
similar fashion, the operator a > changes the stable state fa to the state fa-i . This 
removes a photon from the field, and a ‘ is known as an annihilation operator . 

The actions of creation or annihilation correspond experimentally to the 
emission or absorption of a photon. Note from (5.31), that the application of the 
creation operator on the vacuum state (n = 0) results in the creation of a photon 
in the n = 1 state. On the other hand, the application of the annihilation operator 
on the vacuum state returns 0 photons. 

We can use these results to describe entangled photons, lasers and 
photodetectors. 


5.5 Spontaneous Parametric Down-Conversion 

Creation or annihilation of photons is the way that the electromagnetic field 
exchanges energy with matter. For example, a photon can be destroyed, 
promoting an electron from one energy state to another, provided that the energy 
difference between the initial and final states is equal to the energy of the 
photon. An important example for quantum photonics is spontaneous parametric 
down-conversion (SPDC) which is a useful method for creating entangled 
photons. Such photons are the workhorse for the examination of Bell’s Theorem 
, which is the subject of Chap. 6. 

Spontaneous parametric down-conversion is a special case of parametric 
optical amplification, which occurs in non-linear optical materials. In Chap. 9 we 
will study some important non-linear optical effects using mostly a classic 
analysis. However, the case of spontaneous parametric down-conversion cannot 
be described by Maxwell’s equations because it involves interactions with the 
vacuum state , where classically speaking, the electric field is zero. In this 
section we can show in a straightforward way, using the quantized treatment of 
photons, some fundamental aspects of parametric down-conversion. 

With the quantized representation of the electromagnetic field it is natural to 
think of the photon as a particle with energy & = and momentum P = We 

will now consider the situation where a photon of energy & = ^ is annihilated, 



creating in its place two photons with energies £ i = and £2 = where 
+ ftcoi conserving energy and where + ^2 conserving 

momentum. These two photons must be created within a wavelength of each 
other in order to conserve momentum. This means that the two lower energy 
photons are created in pairs simultaneously, (that is within a time interval 

< ij) with the annihilation of the higher energy photon in order to conserve 

energy. The Hamiltonian for this interaction can be written down straight away 
as: 

f H = f tfat + yj + by (af ^ a 3 + a i a 2^j) = + *H\ (5.39) 

The additional term contains a coupling constant 7 and corresponds to the 

annihilation of a photon in the mode 3 and the creation of two photons from the 
vacuum in modes 1 and 2 where no photons were present initially. It contains a 
second term corresponding to the annihilation of two photons in the vacuum 
state, but this returns 0. Since the interaction is simultaneous, the order of the 
events is not important. 

The initial state is written: 

,fc(f = 0) = |G,G,Ak> ^ (5.40) 

The effect of the Hamiltonian acting on this state is: 

| (t )> = hy 11,1, 3 - 1 > + 0 (5.41) 

The intensity of the down-converted beam is proportional to the square 
modulus of this state. 

/ « \<H X fa {t)f | « N 3 (5.42) 

and is linearly proportional to the intensity of the incident beam. As a result, 
the conversion efficiency is not dependent on pump beam intensity (Fig. 5.1). 



Average Pump Power (% max value) 


Fig. 5.1 The detection rate of down-converted photons is linearly proportional to the pump power, in 
agreement with (5.42). a, b correspond to two different sample thicknesses. In each case, the three curves 
correspond to different conditions of pump beam focus (reproduced from O. Siizer and T.G. Goodson III, 
Optics Express, 16 , pp. 20, 166-20, 175 (2008) by permission of the Optical Society of America, 2016) 

In practice, the down-conversion effect is very weak, with an efficiency of 
about 1 part in 10 12 . That is, 7 (the coupling constant) is small. This is in part 

due to the 3-particle interaction which requires all three particles to be in the 
same place (within a wavelength) and the same time (within a reciprocal 
frequency). Thus it can be treated as a perturbation. Such a low rate assures that 
any down-converted photons exit the conversion crystal before the generation of 
a subsequent pair. For measurements on distinguishable photon pairs, such a low 
efficiency results in an arrival rate of several thousand per second. This is an 
advantage in experiment because it allows substantial “dead time” between the 
arrival of photon pairs at the detectors, and this allows the clean identification of 
correlated photon pairs. 

Birefringent crystalline materials provide a medium in which both energy 
conservation and momentum conservation can be achieved. Beta Barium Borate 
ip - B(iB 2 CM) or BBO is a birefringent non-linear crystal widely used for this 

application. Its wavelength range of transparency is 300 < A < 2000 nm. Silicon 

avalanche photodiodes are a good choice to detect single photons . Their 
sensitivity is optimal at wavelengths shorter than 900 nm. For example, a He-Cd 
laser emits at A = 325 nm in vacuum = 3.82 eV) , it can generate two photons 

having half the energy (^ = 1-91 cV. f = 650 nm j n vacuum). Using type-I phase 

matching, the polarization of the two photons is the same. When type-II phase¬ 
matching is used, one photon will be polarized at 90° with respect to the other. 





This is the desired situation for generating an entangled photon pair. 

Momentum conservation is achieved by considering the wavevectors of the 
three photons. The total momentum, i.e. the wavevector of the incident beam, is 
determined by the experimental choice of its propagation direction in the 
birefringent crystal relative to its optical axis . With the energy of the incident 
photon invariant and the wavevector determined by the propagation direction, a 
number of solutions for momentum conservation are possible. 

The down-converted photons span a range of allowed photon energies and 
emission angles. When type-I phase matching is used, the two down-converted 
photons are emitted on the surface of a cone. A presentation of this effect by 
Alan Migdall can be viewed on Youtube (Migdall 2010). An image of this 
fluorescence is shown in Fig. 5.2 (Ling 2008). 



Fig. 5.2 Photograph of parametric down-conversion in BBO, using type-I phase—matching. The laser 
pump beam (energy = 3.82 eV) interacts with BBO through its 2nd order nonlinear optic coefficient to 
annihilate a pump photon and create a pair of photons from the vacuum state. Many possibilities exist for 
the wavelength and emission directions of the down-converted photons provided that total energy and 
momentum are conserved. (Photograph courtesy of Prof. Alexander Ling, Center for Quantum 
Technologies, National University of Singapore, reproduced by the kind permission of the author) 

To illustrate, we suppose the case of degenerate down-conversion , where the 
energies of the two down-converted photons are the same. 

Conservation of energy: (03 = 2to\ = lo. 

Conservation of momentum k^ = k| + k 2 
where 

IM = ^is the index of refraction of the pump beam at the higher 


(5.43) 




frequency and in the direction of propagation in the crystal. 

One photon of the down-converted pair is emitted along the extraordinary 
axis. 

I k il = is the extraordinary index of refraction at = OJ - = i 1 . 

It is polarized parallel to the plane defined by the optic axis of the crystal and 
the pump beam. 

The other photon of the pair is emitted along the ordinary axis: 

IM = ^ : n (> j s the ordinary index of refraction at °- r - = i 1 . 

This photon is polarized perpendicular to the plane defined by the optic axis 
and the pump beam. 

In the case of type-II conservation of momentum , l^il + |k:l > M. In this 

case, both ^ and ^ must be oriented at angles with respect to . This means 

that the down-converted photons are emitted in different directions, away from 
each other and from the pump beam. This is a convenient way to separate the 
photons (Kwiat et al. 1995). Momentum conservation is achieved when: 

|k;| = |ki|cos (tt|) + 1 It:! cos ((* 2 ) (5.44) 

The photon pairs that satisfy both energy and momentum conservation are 
emitted in directions that lie along the surface of a cone, one for each photon of 
the down-converted pair. The angle of each cone (<*i or <*") depends on the 

wavevectors of all three photons. 

When the pump laser beam at 4 = 225 am (i n vacuum) passes through a BBO 

crystal, some of the photons undergo spontaneous down-conversion with 
trajectories that are constrained by conservation of momentum to lie along the 
surface of two cones, as shown in Figs. 5.3 and 5.4. The polarization of the 
photon on one cone is correlated and perpendicular to the polarization of the 
photon on the second cone. Because of conservation of momentum, each photon 
of a pair travels on a trajectory that is symmetric to that of the second photon 
with respect to the axis defined by the trajectory of the pump beam. 




Optic axes 





extraordinary 


r 

Surface of the 
&BO crystal 


ordinary 


z 



Fig. 5.3 A schematic diagram of spontaneous parametric down-conversion in a non-linear crystal like BBO 
, using type-II phase-matching. Down-converted photons are emitted along the surface of a cone only when 
momentum and energy are simultaneously conserved. (Fig. 5.3b copyright J-Wiki at English Wikipedia, 
2011, https://commons.wikimedia.org/wiki/File:SPDC figure.png) 
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Fig. 5.4 A diagram of the cross-section of the cones along which the down-converted photons propagate, a 
A photon can be emitted along any path along the cone surface, but the position of its pair is correlated so 
that momentum is conserved. The polarization of the photons is mutually orthogonal, b When the angle 
between the pump beam and the optic axis increases, the beam experiences a smaller index of refraction in 

BBO, and its wavevector increases. To achieve momentum conservation, the angles a \ and a 2 must 

decrease. This causes the two emission cones to intersect, creating pairs of polarization-entangled photons 














Changing the angle of the pump beam relative to the optical axis also 
changes its wavevector. While the energy conservation condition remains 
unchanged, this modifies the condition for momentum conservation, resulting in 
a change in the angle between the two emission cones . An example is 
diagrammed in Fig. 5.4b where the two emission cones intersect. A video of this 
experiment has been prepared by the group of Prof. Alexander Ling at the Center 
for Quantum Technologies at the National University of Singapore. You can 
view this on-line at: http://qolah.org/research/hqpdc/rings.avi. 

Patrick Bronner of the University Erlangen has created a sequence of on-line 
experiments “Quantum Lab” on quantum photon behavior. This presentation 
includes a video of spontaneous parametric down-conversion as well as 
experiments using entangled photons prepared by this method. These can be 
viewed at: http://www.didaktik.physik.uni-erlangen.de/quantumlab/english/ 
index.html “Entanglement”. 

The trajectories of the photon pairs may exist simultaneously along the two 
lines where the surface of the cones intersects. This effect can be captured, and a 
photographic image created by down-converted photons is shown in Fig. 5.5 
(Reck 1996). For photons propagating along this line, the polarization is 
indeterminate, although the net polarization of the pair remains zero. This results 
in entanglement of the photon pairs (Catalano 2014). Because of this simple 
symmetrical geometric arrangement, it is straightforward to harvest these 
entangled pairs. 



Fig. 5.5 Photographic image of down-converted photon pairs in BBO. The two emission cones have been 
made to intersect in the experiment creating polarization entanglement. From experiments carried out by 
P.G. Kwiat and M. Reck. (Photograph courtesy of Prof. Paul Kwiat, Department of Physics, University of 
Illinois, Urbana-Champaign, reproduced by the kind permission of the authors) 


5.6 Two-Photon Interference 

Interference of optical beams has been studied at least since the observations of 
Francesco-Maria Grimaldi in the 17th century. Single photons also show 
interference. The experiments of Dykstra et al. and Grangier et al., discussed in 
Chap. 2 show that the wavefunction of single photons interferes with itself, and 
produces an interference pattern that is characteristic of the structures 
encountered during propagation. Self-interference is not restricted to photons, it 
is a general property of single quanta. The electron interference pattern obtained 
by Tonomura (Fig. 2.11) is an example of self interference of single electrons. 
Experiments such as these demonstrate the extended and non-localised nature of 
the wavefunction of quantum particle-waves. 

On this topic, Dirac has stated on page 9 of his text on quantum mechanics 
(see reference Dirac 1930) “Each photon interferes with itself, Interference 
between two different photons never occurs”. 

This is perhaps an overstatement of the situation. To be sure, two photons do 
not interact with each other, but interference can occur if the two photons are 
indistinguishable. Indistinguishable photons have the same defining quantum 
states of frequency and polarization. For interference to occur, the wave 
functions of two indistinguishable photons must also overlap in space and time. 
On the other hand, such interference between two different electrons, and in 
general between fermions is forbidden by the Pauli exclusion principle. 

We could restate Dirac’s claim as follows: “Interference between two 
distinguishable photons does not occur”. 

The interference of a photon with itself, or the interference of an electron 
with itself, is the degree zero of indistinguishability. Although the wavefunction 
may be reshaped by passage through an object, such as a tunneling barrier or a 
beam splitter, or a pair of slits, the quantum properties of each part of the 
wavefunction of a single quantum are identical, and when these components can 
be made to recombine at the same place at the same time, quantum interference 
is inevitable (Ou 2007). 

In this section, we will present some important experiments that show 
quantum interference between two photons, demonstrating what is meant by 
“indistinguishable”, and “in the same place at the same time”. 

A photon has two possibilities when it encounters a beam splitter: reflection 



or transmission (ignoring absorption or inelastic scattering). When two photons 
are incident on a beam splitter, one on each side, there are four possible 
outcomes. These are indicated schematically in Fig. 5.6. The two photons are 
colored differently making them, as well as the outcomes distinguishable. Note 
that the “blue” photon experiences a ^-phase shift on reflection, relative to the 

behavior of the “red” photon, following the presentation of the beam splitter in 
Chap. 2. These photons could be distinguishable for several reasons: different 
frequencies, different polarizations or different times of passage through the 
beam splitter. 



Fig. 5.6 Two photons can intersect a beam splitter in a number of ways. If the photons are distinguishable 
(one in red , the other in blue) there are four distinguishable possibilities. There is a relative phase shift of 

n radians between photons reflected from one surface relative to those reflected from the other surface, 

represented here by the change in sign of the events 3 and 4 compared to events 1 and 2 

The two photons are indistinguishable if they have the same frequency and 
polarization and they arrive at the beam splitter at the same time. This situation 
is diagrammed in Fig. 5.7. 



Fig. 5.7 When the two photons are indistinguishable, then the number of possibilities is reduced to 3 
events. However, events 2 and 3 are indistinguishable, but have different signs. The wavefunctions 
corresponding to these two events interfere and cancel each other 


There are 4 cases as before, but only three of these cases are distinguishable. 
Cases 2 and 3 cannot be distinguished. In a 50%-50% beamsplitter, the 
amplitudes of the wavefunctions representing these two cases are equal. Because 










they are out of phase by n radians, they will cancel each other. Only states 1 and 

4 remain. This is an example of two-photon interference. 

Using the operators of the quantized electromagnetic field developed above, 
we can develop the quantum mechanical description of this two-photon 
interference . 

In Fig. 5.8 we show a schematic diagram of a beam splitter with input ports a 
and b and output ports c and d. 


Fig. 5.8 A simplified diagram of a beam splitter with input ports a and b and two output ports c and d 


Using the creation operators and , one photon is created at each input 
simultaneously: 

|G,0U = |UU (5.45) 

In the experiment to be described below, these photons are created by 
spontaneous parametric down-conversion with the same frequency and 
polarization. Subsequently, they propagate to the input ports of the beam splitter. 
The beam splitter mixes the modes at the input ports and turns them into new 
modes c and d. The creation operators a ] and b] are also transformed by the 

beam splitter. Assuming a 50%-50% beam splitter, thus 
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(5.46) 


where the minus sign in (5.46) reflects the relative n phase-shift between the 


two reflected modes exiting the beam splitter. This ensures that the beam splitter 
conserves energy; that is, that the transformation is unitary . Arranging in matrix 
form: 







(5.48) 
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Developing (5.45) 
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We recognize the last term as the commutator of the two creation operators ; 

, which vanishes by orthogonality (5.30). 


IMU-> |{V2|2,0>^- V2|Q,2), y/ ) (5.50) 

Thus, two identical photons entering a 50%-50% beam splitter will appear 
only as photon pairs at one or the other of the exit ports. As a consequence the 
photons will not appear coincidentally, one at each of the exit ports. The result 
that coincidence vanishes when the two identical photons enter the beam splitter 
is an example of interference of photon wave functions at the quantum level. 

The measurement that demonstrates this result is the Hong-Ou-Mandel 
experiment. This effect is a cornerstone for studies and applications of photon 
correlation and interference, including quantum communication and quantum 
computing. Leonard Mandel was a distinguished scientist and teacher whose 
specialty was quantum optics. The Hong-Ou-Mandel experiment, published in 
1987 (see references), marks a major achievement in his lifelong studies of two- 
photon interference. 

In Fig. 5.9, we show a schematic diagram of a Hong-Ou-Mandel 
measurement. 



























Fig. 5.9 A schematic diagram of the experimental set-up used by Hong, Ou and Mandel for measurement 
of 2-photon interference (Hong et al. 1987). Spontaneous parametric down-conversion. Type-I phase¬ 
matching applies in KDP so the photons are emitted along the surface of a single cone centered on the pump 
beam axis. The polarizations of the two photons are the same. The relative arrival times of the two photons 
can be varied by displacement of the beam splitter 

The measurement proceeds as follows. A UV photon beam (argon laser: 

4 = 151.1 mn) enters a non-linear optical crystal of Potassium di-Hydrogen 

Phosphate , or KDP. KDP can be used to produce photon pairs by spontaneous 
parametric down-conversion. The crystal is cut and aligned to produce two 
photons of the same energy, (degenerate SPDC). Most of the UV beam passes 
straight through and is absorbed in the beam stop as shown. A small fraction ( 

~ h) ■■) of the beam is transformed by SPDC into photon pairs each having one- 

half the frequency of the photons in the pump beam. These photons have the 
same frequency and the same polarization and are emitted along the surface of a 
cone determined by the type-I phase-matching conditions for KDP. Note that the 
photons have different momenta and propagate in different directions. This does 
not affect indistinguishability. Mirrors are used to cause the photons to intersect 
the beam splitter, as shown in Fig. 5.9. From (5.49) we know that two photons 
will experience quantum interference if they are indistinguishable. By design, 
the two photons in this experiment have the same frequency and the same 
polarization. The question is: do they intersect the beam splitter at the same 
place at the same time? In this experiment the beam splitter is mounted on a 
precise displacement stage and scanned over a range of distances as indicated 
schematically in Fig. 5.9. The scanner changes the relative distances the photons 
have to travel in order to reach the beam splitter. In this range is the point in 
space where the two photons reach the beam splitter and overlap at the same 
time. 

Photon arrival at the two exit paths is recorded by the detectors. A 
coincidence detector determines whether photons are seen at both detectors at 
the same time (within a time interval much shorter than the time between 
detection events. Hong, Ou and Mandel chose this time interval to be 7.5 ns.). 
Since the creation rate of SPDC photons is in general quite low, (several 1000 
per second) it is relatively easy to separate the detection events in time. The 
measurement of coincidence events published by Hong, Ou and Mandel is 
shown in Fig. 5.10 and the message is unambiguous. At the left of the figure, the 
photons do not arrive at the beam splitter at the same time, so they are 
distinguishable, and no interference occurs. The maximum coincidence rate is 
about 10 events per second. On the right hand side, the same situation obtains. 


However, in the center of the figure, the photons arrive at the beam splitter at the 
same point in space at the same time, and quantum interference reduces the 
coincidence rate to near zero. 



Fig. 5.10 Result of the Hong-Ou-Mandel measurement of 2-photon interference. On the left (beam 
splitter < 280 jam) and on the right (beam splitter > 330 jam) the two photons pass through the beam splitter 
at separate times. There is no interference, and detection of a photon in each detector within the same time 
interval can be recorded as a coincidence. When the beam splitter is positioned in between, the photon 
wavefunctions overlap in space and time, causing the probability for coincident events to vanish. This 
figure courtesy of Z-Y Ou [reproduced by permission of the American Physical Society from Hong et al. 
(1987)] 

Two photons must have the same polarization in order to interfere. This 
requirement has been quantified by Kwiat, Steinberg and Chiao in a subsequent 
publication (Kwiat et al. 1992). They demonstrated how the relative polarization 
of two photons can be used to determine whether or not two-photon interference 
takes place. Experiments where polarization rotation is imposed are different 
from those in which photons are detected. Detection destroys the photon, 
whereas polarization rotation does not. 

The half wave plate shown in Fig. 5.11 is used to rotate the polarization of 
one of the photons entering the beam splitter. A rotation of the half wave plate by 
0 = 451 rotates the polarization of the photon by - 90 . First of all, the 

trombone delay line is adjusted so that the two photons arrive at the beam splitter 
at the same time. When the polarizations of both photons are the same, 2-photon 
interference can occur. When the polarizations are oriented at 90° to each other, 
the photons are distinguishable and there should be no interference. Thus the 
Hong-Ou-Mandel measurement should show a dip in the coincidence count 
when the polarizations are aligned and absence of a dip when the polarizations 





are orthogonal. This result is demonstrated in Fig. 5.12a. 



Fig. 5.11 The measurement set-up used by Kwiat, Steinberg and Chiao is similar to the Hong, Ou and 
Mandel experiment (Kwiat et al. 1992). The superposition of the single photons on the beam splitter is 
achieved by a “trombone-prism” variable delay line instead of moving the beam splitter directly. A half¬ 
wave plate is added in the path of one photon in order to rotate its polarization relative to that of the other 
photon 
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Fig. 5.12 The importance of relative photon polarization in two-photon interference, a A Hong-Ou-Mandel 
measurement with different relative polarizations of the two photons on entering the beam splitter. 

Complete two-photon interference is achieved when the photons have the same polarization and cross the 
beam splitter at the same time and place, b Measurement of the Hong-Ou-Mandel dip. The half wave plate 
in one arm is rotated, changing the polarization orientation of one photon. When the visibility is zero, 
coincidence counts remain at the same values recorded when the photons do not cross the beam splitter at 
the same time. When the visibility approaches unity, photon correlation reduces the coincidence counts to a 
minimum level (reproduced by permission of the American Physical Society from Kwiat et al. (1992)) 

Kwiat et al. (1992) have demonstrated that 2-photon interference can be 
restored to a pair of distinguishable photons , if their polarizations can be made 
identical, thus making the pair indistinguishable. This manipulation is known the 
quantum eraser effect (Hillmer and Kwiat 2007). A quantum eraser can be 
created by placing a polarizer set at 45° relative to both the vertical and 









horizontal in each photon path directly in front of each detector (and far away 
from the beam splitter). The effect of these polarizers is to erase the information 
about the polarization labeling and thus the origin of either photon prior to 
passage through the polarizers, by putting each photon into a superposed state of 
horizontal and vertical polarizations. Since the photon wavefunctions are now 
made indistinguishable, two-photon interference will be observed by the absence 
of coincident detection events. The quantum eraser is an important tool for 
editing quantum states in quantum communication applications. 


5.7 Summary 

In this chapter we have investigated the description of the electron-magnetic 
fields in terms of the photons that constitute them. The Hamiltonian operator for 
the total energy of the electro-magnetic fields consists of two terms representing 
the energy stored in the electric and magnetic fields respectively. Propagation of 
the fields is accompanied by an oscillation of the energy between the electric and 
magnetic fields with a frequency The quantum of this oscillation is the 

photon. The description of the electro-magnetic fields in terms of the quanta is 
formally equivalent to that of the quantum mechanical harmonic oscillator. To 
quote Dirac (5.1) who detailed this analysis in 1930, as the Schrodinger wave 
equation was also being developed: 

The dynamical system consisting of an assembly of similar bosons (e.g. 
photons) is equivalent to the dynamical system of a set of oscillators - the 
two systems are just the same system looked at from two different points of 
view.... This is one of the most fundamental results of quantum mechanics, 
which enables a unification of the wave and corpuscular theories of light to 
be effected. 

The procedure of second quantization, which is the introduction of creation 
and annihilation operators, enables the analysis of systems of bosons to extract 
the energies and occupation numbers of the stable states of such systems without 
having to treat explicitly the eigenvectors of the system. In addition to the 
fundamental unification of the wave and particle viewpoints, this procedure also 
contributes a considerable simplification to the analysis of quantized energy 
levels. 

Thus, an electro-magnetic oscillation having a frequency w can be described 
at the quantum level as having an energy E = ^ LO ( /? + y), where n is the number of 



photons comprising the oscillation. The total energy of a wave consisting of 
multiple frequencies is just the arithmetic sum of the contribution of each 
frequency present. Thus treatment of the electro-magnetic field from the 
quantum point of view is formally equivalent to that of a set of independent 
harmonic oscillators. The relevant quantum variables are the occupation number 
n of each frequency mode and the energy of that mode. 

The creation and annihilation operators can be used to describe the emission 
and absorption of photons in a direct and transparent way, without having to 
derive the quantum wavefunction in explicit form. This is a key practical result 
of this chapter. We can use directly the results to analyze stimulated and 
spontaneous emission (and absorption) of photons, and thus the performance of 
Lasers, LEDs and photodiodes. 

The method of quantization of the electromagnetic field is complementary to 
the analysis of electromagnetic behavior using Maxwell’s field equations. There 
is a range of situations where both methods can be used in an equivalent manner. 
However, there are limits. When photons or electrons interact with vacuum 
fluctuations, then the quantized electromagnetic method must be used. This is 
the case for the description of laser or LED emission, and as we showed in the 
previous section, for spontaneous parametric down-conversion. Analysis using 
Maxwell’s equations fails to account for experimental observation. On the other 
hand, interactions of photon flux or electron current with the macroscopic world 
is much easier using Maxwell’s equations which describe the coupling and 
propagation of the electric and magnetic fields belonging to these particles. 


Exercises 

5.1 Use the Heisenberg uncertainty principle to show that the minimum energy 
of the quantum harmonic oscillator is /: min = “f. 

5.2 The wave function for the n = 1 state of the quantum harmonic oscillator 
is; 

, / it \ 3 „ a 2 * 2 

(jc) = —- (2ax)e 2 , where n = 1 

W?«! 2 n i 

And the wave function for the n = 2 state of the quantum harmonic 
oscillator is; 
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Show by algebraic derivation that a+ l^i) - ^ 1^) 


5.3 Prove that the commutator [ £, -. a ' I - 1 

5.4 Show that WD = where is the wavefunction describing n 
photons of frequency w 

5.5 Derive the commutator \ a i (?)* a t 001 for the creation and annihilation 
operators 

< (?) = (?) ~ l Pt <*)] and ^J <0 = ^ + ipi ^ 

5.6 Two-photon interference 

(a) Using data from the Hong-Ou-Mandel experiment shown in 

Fig. 5.10, estimate the width in time of the wavepacket of the single 
photon created by Spontaneous Parametric Down-Conversion. 

(b) What is the approximate real-space size of the single photon 
wavepacket when it intersects the beamsplitter? Comment on possible 
procedures to align two wavepackets so that they overlap? 

(c) What happens to the wavepacket alignment and the measurement 
when the beamsplitter position in Fig. 5.9 is scanned? How is this 
“walk-off” effect resolved in Fig. 5.11? 
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Abstract 

Entanglement describes a situation in which two entities share a common 
attribute. The Einstein-Podolsky-Rosen ( EPR ) paradox concerns the behavior of 
entangled states. Their 1935 publication in Physical Review considered the 
measurements of properties on two systems which are first allowed to interact, 
and therefore become entangled, and which are subsequently separated so that 
they are no longer interacting with each other. In proposing this condition, EPR 
supposed locality. Bell’s theorem analyzes the outcomes of measurements at two 
different locations, A and B under the assumption of locality. Bell’s theorem 
allows for the presence of a “local hidden variable” that accounts for some or all 
of the apparent statistical nature of the measurements at locations A or B. The 
analysis places upper limits on the correlations that can be measured between 
results recorded at locations A and B, regardless of the nature or form of the 
hidden variable. For more than 2 decades, careful measurements of entangled 
photon pairs show significant violations of Bell’s limit. This result is widely 
accepted as demonstration that measurement of an entangled quantum does not 
depend entirely on local conditions where the measurement is made. The reality 
of the non-local behavior of entangled photon pairs is a foundation stone for 
quantum cryptography. This technology of ultra-secure communication is based 
on the distribution to authorized receivers of a quantum key that enables reading 
of encrypted messages. The quantum key is determined by the detection of 



photons correlated by entanglement at two different locations. 


6.1 Introduction 

Why did Marguerite fall in love with Faust? Faust, according to legend, was a 
scientist and philosopher in full mid-life crisis. His story has been treated by 
many playwrights and composers. However, Goethe’s early nineteenth-century 
treatment stands out among all others. Johann Wolfgang von Goethe is a 
colossus of German literature, and his impact on western European culture is 
truly profound. His literary output of plays, poetry, novels, essays, travelogues, 
and scientific texts fills more than 100 volumes, including Theory of Colours , 
published in 1810 and still in print today (von Goethe 2006). 

In Goethe’s Faust, Marguerite, a chaste, devout and beautiful young woman 
falls hopelessly in love with Dr. Faust, an aging and bitter scientist seeking to 
recover his lost years. Their lives become entwined, and things go rapidly 
downhill as a result. Why would Marguerite fall for such a reprobate? She does 
not understand herself. However the audience knows that there is a hidden 
variable : Mephistopheles, a representative of Satan with supernatural powers. 
Mephistopheles controls the relationship between Marguerite and Faust. As a 
result Marguerite’s free will to decide is sacrificed as part of a deal between 
Faust and the devil. An important theme in Faust concerns free choice . 

Consider the following simple experiment: There are 2 subjects and a bag 
with an equal number of black and white balls inside. The balls are wrapped so 
their color cannot be seen. Subject no. A reaches inside the bag and picks a ball. 
His chosen ball has a definite color, but he does not know what it is. The bag is 
then taken to another room where subject no. B is waiting. She reaches into the 
bag and takes a second ball. After the selection is complete, subject no. A first 
unwraps his ball and notes the color. Then without any communication, subject 
no. B does the same. 

After several repetitions of this experiment, analysis shows that subject no. A 
chooses a black ball 50% of the time, as expected. The same is observed for 
subject no. B. However, when taken together, there is a strong correlation 
between the results of the two subjects. When subject no. A chooses a black ball, 
the chances are 2 to 1 that subject no. B chooses a white one. And, the reverse is 
also true. When subject no. A chooses a white ball, then the chances are 2 to 1 
that subject no. B will choose a black ball. 

How can the choice made by subject no. A so strongly influence the choice 
made by subject no. B when he does not know what color of ball he has chosen, 
and there is no path for communication between the two subjects? 



The explanation is that there is hidden information . There are only 4 balls in 
the bag: two black and two white. If we complete the description of the 
experiment with this additional information, then the statistical behavior of 
observed results is apparent. In quantum mechanics, the situation can be more 
mysterious. Two particles which are entangled quantum mechanically can be 
separated by significant spatial distance. A measurement (equivalent to looking 
at the color of a ball) made on one particle, forces the second particle with 100% 
certainty to adopt a specific state. The question is whether hidden variable 
causes this reaction to occur. Or, on the other hand, does the entangled pair 
behave like a quantum with a single wavefunction, so that wavefunction collapse 
on measurement of one of the particles forces the quantum into an eigenstate, 
determining instantaneously the states of both particles. 


6.2 The Einstein, Podolsky Rosen Paradox: Is the 
Description of Reality by Quantum Mechanics 
Complete? 

6.2.1 Entangled States in Quantum Mechanics 

In quantum mechanics two particles become entangled when they share a 
common measureable attribute like spin or polarization, (see Sect. 2.5). 

Although the entangled pair consists of two particles, under certain 
measurements they behave like a single entity. Suppose we have an entangled 
electron-positron pair . The measurement of the spin of the electron, by having it 
pass between the poles of a magnet, will also determine the spin of the positron, 
even though it may be spatially far away. After the measurement, the particles 
are no longer entangled, because each is now in a well-defined spin state. 

We can express the wavefunction of a pair of particles as I'k 1 +- 2 ). In certain 

cases, we can write this wavefunction as the tensor product of the wavefunctions 
^1 and fa of the two particles that make up the pair. That is: 

|'P|+-2) = l^i) ® 1 ^ 2 ) (6.1) 

In this case the wavefunction of the pair can be factored into 2 pure states. 

Measurements on particle 1 can be made without affecting particle 2. 

However, (6.1) is only one possibility. In general, the wavefunction of a pair 
of particles is a superposition of states which can be factored only under special 
circumstances. For example, the state: 



( 6 . 2 ) 


| Ti . 2 > = —(|+>,|+> 2 + |-> il -> 2 ), 

V2 


which could represent the spin of an electron-positron pair, cannot be 
factored into 2 pure states. As a result this is called an entangled state . 
On the other hand, the state: 



(6.3) 


can be factored into pure states of the individual particles, so it is not 
entangled. 

The Einstein-Podolsky-Rosen ( EPR ) paradox concerns the behavior of 
entangled states (Fine 2015). Their 1935 publication in Physical Review 
considered the measurements of properties on two systems which are first 
allowed to interact, and therefore become entangled, and which are subsequently 
separated so that they are no longer interacting with each other (Einstein et al. 
1935). In proposing this condition, EPR supposed locality . That is the 
measurements on system I cannot affect the state of system II. In particular, the 
state of system II cannot be affected simultaneously by measurements on system 
I. 

EPR states: In attempting to judge the success of a physical theory, we may 
ask ourselves two questions: (1) “Is the theory correct?” and (2) “Is the 
description given by the theory complete?” It is only in the case in which 
positive answers may be given to both of these questions, that the concepts 
of the theory may be said to be satisfactory (Einstein et al. 1935). 

This statement is the equivalent of the concept of logical proof, both 
necessary and sufficient . 

EPR states next: Every element of the physical reality must have a 
counterpart in the physical theory. We shall call this the condition of 
completeness . The elements of the physical reality must be found by an 
appeal to results of experiments and measurements. 

The analysis of EPR considers a wavefunction that describes the entangled 
state of the two systems: 



(6.4) 


where *1 are the variables that describe system I and are the variables to 




describe system II. The lt * (*0 are the eigenfunctions that correspond to the 

measurement of some property A of system I. After a measurement of A, 
eigenvalue ^ is found and system I collapses to the single state . The 


expression for ^ (-*1 * * 2 ) - (*2) w* Or) has simplified to a single term and the 


wavefunction for system II is ^ (-*- 2 ). If we now make a second measurement on 
system I of a different property B, having a different set of eigenfunctions ' J « O-'O 
and eigenvalues , then we can write 



(6.5) 


After making the measurement on system I, we find the eigenvalue b T f and 
the system is in the eigenstate v r(-*a). As a result, system II is now found to be in 
a single state *ft (* 2 ). EPR note the contradiction, that a measurement of different 

properties of system I results in changing the state of system II, even though it is 
postulated that these systems are not interacting. They conclude that it is 
possible to assign two different wavefunctions to describe the same reality 
(system II), and that this result suggests that the quantum mechanical description 
is not complete. A better conclusion would be that analysis of EPR is rather an 
elegant demonstration of the non-local nature of entangled systems. 

Fifteen years later, David Bohm illustrated the situation by a simple, but 
concrete model experiment involving an entangled electron-positron pair (Bohm 
1951, 1989). In this model a source emits an electron-positron pair whose global 
magnetic moment is zero. The electron proceeds from the source towards 
measurement station A where there is an analyzing magnet and detector. The 
positron proceeds in the opposite direction toward station B where there is a 
similar measurement apparatus. This model is shown schematically in Fig. 6.1. 


A 


B 



analyzer 


analyzer 


Fig. 6.1 Schematic diagram of the EPR-Bohm model experiment that illustrates non-local behavior of an 
entangled pair of particles. The source emits entangled electron-positron pairs. The spin of the particle pair 




is zero. While such an experiment is simple to imagine, its actual realization is difficult, requiring high 
vacuum over substantial distances and other practical difficulties 


In the absence of a measurement, there is no way to know what the 
orientation of the electron spin is. In the Copenhagen interpretation of quantum 
mechanics the orientation of the electron spin is in a superposition of the 2* 

possibilities. The same statement can be made about the positron. However we 
do know with certainty in this case that the magnetic moment of the electron is 
equal and opposite to the magnetic moment of the positron. 

When the electron passes through the magnetic field of the analyzing magnet 
at A, it is forced into an eigenstate with a distinct eigenvalue: either spin up ( 

= 1+0) or spin down (= I -1 ). As a result of the wavefunction collapse, the 

positron must adopt a distinct eigenstate with its spin oriented opposite to that of 
the electron, i.e., spin down (= or spin up (= I+ 1 )) respectively. There may 

appear to be a cause and effect relationship between the measurement of electron 
spin at A and the determination of the positron spin at B. The causality 
requirement of special relativity would require that the measurement of electron 
spin at A could affect the positron spin at B no faster than the speed of light. 
However, wavefunction collapse is not a cause and effect relationship. It is an 
operation that affects the whole wavefunction, and not just the part that is 
localised at A (or B). Thus the collapse happens everywhere in space where the 
wavefunction is defined. 

The collapse of the wavefunction accomplishes the transition from the 
quantum environment (wavefunction) to the world of classical reality 
(eigenvalues). The collapse does not transmit information. Before the collapse 
we know already that the spin of the electron is opposite to the spin of the 
positron. At the moment that the electron spin is determined, nothing additional 
is learned about the spin of the positron, and so no information is transmitted. 
The correlation of the spin orientation between these two particles is not limited 
in quantum mechanical theory by the distance of their physical separation. After 
a short reflection, it must be the case that the wavefunction collapse is 
instantaneous. If it were any other way there could be serious problems of 
conservation of energy, momentum, and in some events like electron tunneling, 
even conservation of matter. 

For purely practical reasons, the EPR-Bohm experiment is difficult to carry 
out. One significant difficulty arises from having to put the apparatus in high 
vacuum to allow the particles to propagate. 1 There is a simple analogue to this 
idea using photons. A photon has two states of polarization (parallel and 


perpendicular) and an entangled pair of photons sharing an overall polarization 
can by created. In such a system, the determination through measurement of the 
polarization of one of the photons will instantaneously determine the 
polarization of the other so that the overall polarization of the pair remains 
unchanged. 


6.2.2 Measurements of Photon Polarization 2 

Suppose that we have 2 photon modes /' and /" characterized by the same 
wavevector k and frequency OJ , but with polarizations & and e>> . The photon 
states are defined by the basis W; ir). We can define a coordinate system as 
shown in Fig. 6.2, with photon propagation along the O-z direction. 



Fig. 6.2 Measurement of photon polarization. One of the two detectors at the output of the polarising 
beam-splitter will measure a signal: “+1” is recorded when the polarization lies along the axis of the 
polarizer and “-1” is recorded when the polarization is perpendicular to the axis of the polarizer. As 
demonstrated in Chap. 2, for a single photon, no other results are possible. Thus the measurement forces the 
photon to adopt a polarization along one of the two directions 


We will define ^ as the angle between the O-x axis and the axis of the 
polarizer. Next, introduce i: f H) as the operator for the polarization measurements 
along 0 and # + f. When the polarization analyser axis is aligned along O-x, 

£<0) = (o_ 0 ,) (6.6) 

The polarizer can be rotated freely around the z-axis. When the polarization 
analyser is oriented at # from O-x, 







(6.7) 


E(8) = 


cos 2& sin 261 \ 
sin 2# -cos 2# / 


The eigenvectors of £ are: 


| + 6>) = costfpO + singly) 
and 

|~0) = -sin(?|X) -costf|r) 

Now assume a photon with a polarization at A relative to O-x. Its 
wavefunction can be written: 


( 6 . 8 ) 


IdO = cosi |X> + sin IT) (6.9) 

The probability of detecting a “+1” is expressed 

P+ (&, A) = 1 = |cos & cos A (X | X) + sin & sin A (Y | Y)\ 2 

= jcostf cos A + sin# sin^i| 2 (6.10) 

PM A) ^cos 2 (0-A) 

Similarly, the probability for detecting a “~1” is 

P_R^) = sin 2 (fl-^) (6.11) 

Equations (6.10) and (6.11) are known as Malus’ Law (https://en.wikipedia. 
org/wiki/Polarizer) . 


6.2.3 Joint Polarization Measurements 

Next we consider the ensemble polarization of a pair of photons. We can express 
the wavefunctions as: 

= |x,) + |r,) 

l/f2 = l^) + 

The joint polarization can be represented by a 4 x 4 matrix: 

= = [|X|, X 2 >; \Xi f Y 2 )\\Yi f X 2 ) ; in, T 2 >] 

The joint probabilities are expressed: 


( 6 . 12 ) 


X 

A 



Fig. 6.3 A photon source emits pairs of entangled photons which propagate in different directions. They 
are detected at measurement stations A and B. Whenever a photon is detected at A, the other is detected at B. 
At station A there is a polarization analyzer, the axis of which is oriented at an angle a with respect to the 
axis O-x. At station B the orientation of the analyser is set at angle b with respect to the axis O-x. The 
measurement of polarization at each station gives either +1 (parallel to the polarizer axis) or -1 
(perpendicular to the polarizer axis) 


For polarization measuring parallel relative to O-x (see Figs. 6.1 and 6.3) at 
both A and B: 

(+!, + !) = /\+(a,b) = J{ + a,*b| tf)| 2 

and for measuring polarization perpendicular to O-x at both A and B, 

(- 1 ,- 1 ) = (a, b) = |{~a,"b ^}| 

The probability for measuring parallel polarization at A alone is: 

n = P++ (a, b) + A- (a, b) 

The wavefunction 

kW = -^<IX*> + ir,r» 

V2 

represents an entangled pair of photons with frequencies w i and ^. The 
state IX X) represents a photon °-*i polarised in the O-x direction and a photon 

also polarised in the O-x direction. The 2-photon state given by (6.16) is 
entangled because it cannot be factored into 2 terms, one associated with and 

the other associated with ^. 

Now let us proceed to measurements on this state. The polarizer at A is set to 
a degrees relative to O-x, and the polarizer at B is set to b degrees relative to O- 
x. 

The joint probability of measuring a “+1” at both A and B is 


(6.13) 

(6.14) 

(6.15) 

(6.16) 






(6.17) 


2 j 

P+ + (a. b) = j^ + a. 1 b | *Pepr)\ = y cos 2 (0 ft - = - cos 2 (a, b) 

and 

P+- (a. b) = j^ + a. b | *Pepr)\ = y sin 2 (8 a - tf b ) = - sin 2 (a. b) (6.18) 

These results depend only on the difference between the angle of the axes of 
the 2 polarizers and is invariable in rotation about the z-axis. 

The probability of measuring parallel polarization at A (“1”) regardless of 
what happens at B is: 

P + {a. b) = F ++ {a. b) + F + ,(a. b) = J (cos 2 (a, b) + sin 2 (a.b)) = J (6.19) 

Similarly the probability of measuring a _l- polarization at A is also j. That 

is, taken separately, each set of polarization measurements at A and at B gives 
the expected result for a random variable. The results appear to be independent, 
yet there is strong correlation, as we will now see. 

For this EPR state, the conditional probability of finding “+1” at B if “+1” 
has been found at A is: 


pr(B given A) 


/pr(B and A)\ 
\ P'(A) j 


P++ (a, a) _ \ cos 2 (8 = 0) _ 1/2 

*\(a) m ~TJl 


( 6 . 20 ) 


These results reveal a fundamental property about measurements of 
correlated particles . Suppose you are operating a laboratory measuring photons 
reaching you from a secure communication channel. You are told that some 
photons have random polarization and others are entangled photon pairs with 
polarization pre-determined by a measurement at the other end of the channel. 
Equation (6.19) says you that you cannot do a measurement on the photons that 
distinguishes one from the other. On the other hand, (6.20) says that you can 
confirm that a photon pair is correlated, but you must have access to 
measurements performed on each photon. 


6.3 Bell’s Theorem^ 

6.3.1 John S. Bell and the Completeness of Quantum 
Mechanics 

So far we have examined the behavior of entangled particles. Measurements on 
entangled pairs of particles show that the result of measurements taken on one 







particle is determined by the state of the second particle, even though it may be 
too far distant to communicate that state at the speed of light. Einstein referred to 
this situation as “Spooky action at a distance ”—apparently in violation of the 
theory of special relativity. For several decades, the EPR paradox gathered dust 
because the apparatus needed to demonstrate the experiments did not exist, and 
there was much more exciting work to be done on elementary particle physics. 
John Bell changed ah this. 

John S. Bell was a physicist, born in Belfast and educated in the UK, earning 
a Ph.D. from the University of Birmingham. He spent most of his career working 
at CERN on particle accelerator design and theoretical particle physics. One of 
his hobbies was a better understanding of the basics of quantum theory. You 
might compare his situation to that of Einstein working for the Swiss patent 
office, and doing special relativity on the side. In 1964, he took a year-long 
sabbatical, the result of which was the development of an inequality relationship 
now known as Bell’s theorem . Bell’s analysis considers measurements having a 
statistical result. These measurements could be on classical macroscopic systems 
or on quantum mechanical particles. The analysis compares measurements made 
at two different locations: A and B. (cf. Fig. 6.4) Of course, the context that 
interested Bell is quantum mechanical measurements, because in specific 
circumstances, measurements on quanta can show correlations that exceed the 
upper limit of Bell’s inequality. 
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Fig. 6.4 Schematic diagram of an experiment to measure polarization of entangled photon pairs. Four 
different measurement stations are used. The acousto-optic modulators in each arm permit switching of the 
photon path to different polarization analyzers. In the development of Bell’s theorem on the limits of hidden 
variables to affect correlation between particles, 4 different measurement combinations are required 


Bell was also interested by the probabilistic characteristic of quantum 







measurements. A hidden variable (a phenomenon, another particle, or 
information like the number of balls in the bag) could be responsible for some or 
all of the seemingly statistical nature of the measurement results. (Recall from 
Chap. 1 that Pauli proposed the existence of a nearly invisible particle, the 
neutrino, to account for seemingly random fluctuations in energy and momentum 
during beta decay.) Bell shows us that statistical analysis of measurements 
carried out independently in different locations could be correlated, but that there 
are upper limits on the possible effect of the correlation. Bell’s theorem shows 
that this limit exists for any hidden variable . This result is an awe-inspiring 
intellectual achievement. 

Bell’s theorem states that any physical theory based on local reality cannot 
reproduce all the predictions of quantum mechanical theory. 

Thus, completing quantum mechanics by addition of more detailed theory 
cannot explain the EPR paradox if we insist that each measurement depend only 
on local settings of the measuring equipment, and be thus independent of the 
situation where the second particle is being measured. Bell’s theorem sparked 
renewed interest in the EPR paradox, and five years later in 1969 Clauser, 

Horne, Shimony and Holt (CHSH) (Clauser et al. 1969) published a proposal to 
make the measurements by photonics, using entanglement of the polarization of 
photon pairs emitted in an atomic cascade. Implementation of the measurements 
requires sensitive equipment, like single-photon counters that were still under 
development. Initial attempts showed promising results. More than 10 years 
later, Aspect et al. (1981) carried out experiments based on the proposal of 
CHSH which showed strong violation of Bell’s inequalities, thus demonstrating 
that in specific experiments, behavior of entangled quanta cannot be explained 
by theories of local interactions, even those incorporating local hidden variables 


6.3.2 Hidden Variables in Quantum Mechanics 

We will consider measurements made on pairs of entangled quantum particles, 
such as photons which have been separated spatially. 

As we have shown in Chap. 2, when a single photon passes through a beam¬ 
splitting polarizing cube , because it is a quantum, there are only 2 possible 
results: parallel polarization or perpendicular polarization. 

Let A ( a > be a function that gives the result of a measurement made at station 

A in Fig. 6.4 using the polarizing cube with its axis oriented in direction a with 
respect to a common reference. Then A ( a ) = ± 1 for the orientation parallel or 

perpendicular to a. 





First of all, we assume that the experimenter has the freedom to choose 
polarizer orientation, and other experimental conditions that prevail at A, 
independently from those at location B. The key element of Bell’s analysis is the 
assumption of locality. Thus, measurements A ( a ) are independent of ^ (b), and 

the result i a ^ b) 0 f a joint measurement is the simple product: 

-l<£(a,b)=A(a)fl(b)<l (6.21) 

Now, let us assume that there is a local hidden variable - *—a parameter or 

function or operator for which the only function is to give preferential weight to 
particular values of orientation a or b. A can therefore change the conditions 

under which A i a ' = 1 or -1, but not those values which are imposed by the 
quantum nature of the particles. That is, relationship (6.21) is not changed: 

-1 < £(a, b) = A (a, A) B (b, i) < 1 (6.22) 

The result of the joint measurement, taking into account the effect of the 
hidden variable, is 

£(a : b) = ^dAp{X)A{*,X)B{h'X) 
where: 

- 1 < E < 1 

A(a.A) = ± 1 (6.23) 

it(b.^) = ±1 

and 


J dAp(A) = 1 

1 Each measurement can be made only once on a given pair of particles, and 
E (a. b) represents a single experiment with orientations a -b, while & (a c ) 

represents a different experiment using a different pair of particles, totally 
independent of the previous pair, and using a different choice for the orientation 
of the polarising cube at B. 

Let’s look at the difference in the results between these two experiments: 


E (a. b) - E (a, c) = J dAp{A) |A (a, A) B (b. A) - A (a, A) B (c, ^)] (6.24) 

£(a.b)-£(a,c) = J dAp(A) [A(a,^)fi (b,A)] ~ J dAp{A)[A(a,A)B{c,A)} (6.25) 


Next we can expand this integral by adding and subtracting the same term: 
A (a. T) B(b. A) A (d, A)B (c, A) This enables the factoring out of some common 

elements. 


E (a, b) - E (a, c) = J dAp [A (a, A) B (b, A) ± A (a, B (b. A) A (d, A) B (c, ^)] 


I 


dAp (T) [A (a. A) B (c, A) ±A (a. A) B (b, A) A (d, A) B (c, A)} 


(6.26) 


E (a, b) - E (a, c) = J dAp {A) [A (a, J) B (b, i)] [ 1 ± A (d, A) B (c, A )] 


f 


dAp U) [A (a. A) B (c, Tl][l ±A (d, T) B (h, A)] 


(6.27) 


By incorporation of an additional measurement ^(<M), we introduced 2 

additional experiments, and thus two additional pairs of particles, in order to take 
into account all the possible permutations of the orientations of the polarising 
cubes at A and B. 

Next, take the absolute value of each side. 


I E (a, b) - E (a, c)| = 


f dAp (A) [/l (a. A) B (b,^)] [1 ± A (d, A) B (c, ^)] 

A 

- f dAp U) [A (a. i) B (c, ^)] [ 1 ± ^ (d, A) B (b, T)] 


(6.28) 


Use the “triangle inequality” to separate the RHS of the equation into two 
terms. Notice that the sign changes to a “+” sign. 


\E (a, b) - E (a, c)| < 


f 


dAp(A)[A(a r A)B(h.A)][l ± A (d,A)B (c,A)] 


+ 


f 


dAp (A) [A (a. A) B (c, A)) [ 1 ± ,4 (d, A) B (b, >i)] 


(6.29) 


Replace the products A (a. A)B (b, A) and A (a, A) B (c. A) by their maximum 
values, i.e.: 1. This substitution does not change the inequality 


|£ (a,b) - E (a,c)| < 


f 


dAp U)[l ± A (d, A) B (c. A)] 


+ 


f 


dAp U)[l ± A (d, B (b, T)] 


(6.30) 



It can be seen that 1 * 4 ( d - 0 B < c -4)J is therefore always positive, lying 
between 0 and 2. The same goes for l 1 * A (<M)ZJ (b,^)]. 

As a result, we can take away the two absolute value operations on the RHS: 


/ 


|E (a, b) - E (a, c)| < dAp {A) [ 1 ± A (d, A) B (c, ^)] + dAp {A) [ 1 ± A (d, A) B (b, ,<)] 


/' 


(6.31) 


Now we can perform the integrations, using I W - 1 . 


I E (a, b) - E (a, c)| <2±(E (d,c) + E (d,b)) (6.32) 

And finally; 

I E (a. b) - E (a. c)| ± (E {d, c) + E (d. b)) < 2 (6.33) 

We can rearrange and remove the absolute value operator 

-2 < E (a. b) - E (a, c) + E {d, c ) + E {d, b) < 2 (6.34) 

This proves Bell’s theorem , in the form developed by Clauser et al. (1969). 

It describes the results of four measurements, each made on a different pair of 
particles using different orientations of the polarizing cubes at A: (a and d) and 
at B: (b and c), under the assumption of locality, that is: the measurements at A 
are independent of the measurements at B. It is apparent that the inequality in 
(6.34) can be evaluated in experiment. We will refer to the expression: 

.9 = E (a, b) - E (a, c) + E {d, c) + E (d. b) (6.35) 

as the Bell function. 


6.3.3 Experiments 

In order to perform experiments that evaluate (6.35) we need to specify the 
settings of the polarizer axes. Typically the polarizers are each set at equal angles 
one with respect to the other, all relative to the O-x axis 

<4 = 0 
% = <P 

Then it follows that 


(6.36) 


l (a. b)=4> 

L (d, C) = <f> 
l (d, b) = 

L (a, c) = 3^ 

The next step is to specify the form of £( al>: '. 

We will first consider Bell’s own hidden variable model. 

In this case, the polarization of the two photons in a given pair depends on a 
parameter A and the measurement of polarization depends on this common 

parameter. Then it is easy to see that there will be a correlation between the 
measurements at A and B. We suppose that the 2 photons have a well defined 
polarization E in the x-y plane that is specified as the angle A between the O-x 

axis and the polarization direction. The orientation of £ changes from pair to 
pair in a random manner, and so A is a random variable equally distributed over 

all angles: d < that is, P( A ) = h. 

The polarizer at A is oriented at angle a with respect to the O-x axis. If the 
absolute value of the angle between the polarization and the polarizer =l(^ a ) is 

less than or equal to f, the polarizer measurement gives “+1”. If f < K^ a )l < I, 

the polarizer gives “-1”. We can model the polarizer response at A using the sign 
function: 

<4 (A, a) = sign [cos (2 (G a - T))] (6.37) 

where sign (A = +1 if x > 0 an d sign (a-) = — 1 if jc < 0. 

A similar argument applies at B: 

B(A, b) = sign [cos (2 (tf h - ^i))] 

Then we can express the expectation value of a joint measurement as 

U b) = J dAp (A)A(A,a)B(AM=l~ (6.38) 

For -f 

We show the expectation value for the result of a joint measurement in 
Fig. 6.5. 




Angle between photon polarization and polarizer axis 


Fig. 6.5 Expected value of a joint measurement of polarizations of a photon pair. Measurement results at A 
are completely locally determined, but dependent on a hidden parameter ' L A similar situation prevails at 
B 


The Bell function for this model can be evaluated assuming the equal angle 
settings of the polarizer-analyser: 

SHnntot-vato* = £ (a. b) - E (a. c) + E (d, c) + E (d, b) = 2. (6.39) 

independent of the setting angle of the polarizers, indicating a high degree of 

correlation between the measurements at A and B. This is the absolute maximum 
value of the parameter that is allowed by Bell’s Theorem. While Bell’s hidden 
parameter model may seem crude, Bell’s theorem shows that one cannot do 
better. 

Next we consider the quantum mechanical case. The polarization is not 
determined until a detection takes place. The probabilities for detection are given 
by Malus’ Law (6.10) and (6.11). The expectation value for a join measurement 
is expressed in the usual way: 

Eq-m (a. b) = (+ 1) P + (a. b) + (-1) /L (a. b) = cos 2 (&) - sin 2 (0) = cos (20) (6.40) 

In Fig. 6.6 we show how the expectation value for joint detection varies as a 
function of angles between the axes of the two polarization analysers. 







Fig. 6.6 Expectation value for the joint polarization measurement based on quantum-mechanical behavior 
of the photons in the pair. The result for Bell’s hidden variable model is shown for comparison. At certain 
angles between the polarizers, (0°, 45°; 90°, etc.) the two models give the same result, but elsewhere, the 
quantum mechanical model gives a larger expectation value 


In order to evaluate the Bell function, we must specify the angle between the 
polarizers. This choice is up to the experimenter. We will use the convention of 
equal angles as discussed earlier. The behavior of the Bell Function is shown in 
Fig. 6.7. 



Fig. 6 .7 Evaluation of the Bell function in the quantum mechanical case using equal angles between the 







polarizers. The Bell function varies between 0 and ±2V2 

. Any excursion beyond the Bell limit of ±2 

(shown as the shaded regions ) implies that non-local interactions are influencing results. Note that there are 
many choices for the angle separating the polarizers that do not result in a violation of Bell’s Theorem. The 

setting that gives the largest deviation is ^ = f 


As an example, we make the choice that gives the most significant departure 
from the limits imposed by Bells Theorem. 

(a.b) = (d,c) = (d,b) = ^ (6.41) 

and, of course: = if 


S e _ jV/ = E (a. b) - E (a, c) + E (d, c) + E (d, h) 

= IV2 + iV2 + iV2 + iV2 = 2V2 
2 2 2 2 


(6.42) 


6.4 More About Bell’s Theorem—Loopholes , Bohm 
—deBroglie and Free-Will 

Bell’s Theorem is remarkable and occupies perhaps a unique place in physics 
because it does not describe the behavior of a particle or even a pair of particles. 
It describes an environment, and a means to test if one is working within this 
environment. In order to evaluate (6.34), at least four separate independent 
experiments should be carried out. The result of such a test does not describe the 
nature of the entanglement of individual photon pairs. Indeed, a typical 
experiment to test of Bell’s inequality involves milli ons of measurements on 
pairs emitted by a suitable source. The results are combined using the recorded 
orientation of the polarizer in the arm where the photon is detected, and the 
evaluation of (6.34) is calculated for the entire sequence of measurements. 
Published reports of such experiments give the statistical analysis in terms of the 
total average value of the Bell function and the width of the distribution in order 
to establish margin of error (See Fuwa et al. 2015; Kwiat et al. 1999). An 
example would be the report photon pairs transmitted in separate optical fibre 
circuits using the telecoms network of the Swiss postal service: 

A test of Bell inequalities by photons 10.9 km apart is presented. Energy¬ 
time entangled photon pairs are measured using two-channel analyzers, 
leading to a violation of the inequalities by 16 standard deviations (Tittel et 
al. 1998). 




Entangled quantum particles are not exceptional. From atoms to the photon 
pairs considered in this chapter, entanglement is rather the rule. In most cases, 
entanglement implies some delocalization in space. Photon pairs that are 
separated by a beam-splitter move apart from each other at the speed of light, 
leading to significant and macroscopic spatial separation. 

Detection of one of the quanta forces it to adopt an eigenstate with 
measurable eigenvalues. However, in the case of an entangled state , the 
measurement made on one quantum is actually made on both quanta that 
constitute the state. Both photons are thus forced into eigenstates with 
eigenvalues, simultaneous with the measurement. If it happened any other way, 
it would not be possible to normalize the wavefunction during the time necessary 
for both particles to adopt their appropriate eigenstate. The EPR paradox is not a 
paradox if one accepts that entanglement of quanta introduces a wavefunction 
for the corresponding state that is distributed in space, and that wavefunction 
collapse is necessary to conserve the normalization of that wavefunction. 

Bell’s theorem shows that a local hidden variable theory might explain some, 
but not all the behavior of entangled quantum particles . Figure 6.7 shows that 
there is a restricted set of experimental conditions which identifies the presence 
of entanglement. Quantum mechanics, on the other hand successfully predicts 
observations, but takes into account the non-local behavior of entangled quanta. 
Experiments that violate Bell’s inequality reveal that entangled pairs of quanta 
are described by a wavefunction that is distributed in space. Thus, analysis based 
only on local settings will not be able to describe all the observations. 

6.4.1 Bell Test Experiments—Loopholes 

With local hidden variables relegated to the sidelines, there remains a legitimate 
question about the effect of non-local hidden variables and their effect on 
experiments. This would be a different situation that lies outside the basic 
premise underlying Bell’s theorem. An example of a non-local hidden variable 
would be communication (<speed of light) of polarizer settings between stations 
A and B (see Larsson 2014). 

Looking in more detail at an actual experiment shows important concerns, 
sometimes called loopholes, that need to be addressed if the results are to be 
considered a fair test of Bell’s inequalities. Aspect et al. (1982) performed one of 
the first measurements of Bell inequalities (http://journals.aps.org/prl/pdf/10. 
1103/PhysRevLett.49.1804) with close attention paid to closing the “locality 
loophole ”. Referring to Fig. 6.4, photon pairs are spatially separated and sent to 
the 2 analysis stations A and B. As in the experiments described in Chap. 2, The 




photons arrive at the detectors nearly simultaneously; Coincident detection at A 
and B within a chosen time interval signals the presence of entangled pairs. 

The locality loophole consists of communication of settings and results 
between stations A and B prior to or during the measurement To close the 
loophole, it is necessary to set the two detection stations far enough apart that a 
signal can not be transmitted between them during the detection interval. In the 
case of Aspect, Dalibard and Roger, that distance exceeds 13 m, corresponding 
to a photon time-of-flight of 43 ns. The acousto-optic modulators switch the 
photon pairs between the polarization detection stations a and d at A and b and c 
at B at a time interval that is shorter than the photon transit time. These 
modulators are unsynchronized so that the choice of which polarization detection 
station actually receives the photon is close to random. Under these conditions, 
the communication of the detection settings between each other or with the 
apparatus that generates the photons is reduced to a minimum. On each side, the 
orientation of the polarizer as well as the results of polarization measurements, 
are recorded as a function of time. After a sequence of measurements, the data 
from the two sides are integrated in order to calculate the Bell function for each 
quartet of polarization measurements. 

In any experiment, the detection efficiency is less than 100% efficient. 

Photon pairs at A and B are only identified as belonging to a single pair after the 
experiment is performed, by judging whether or not their detection times are 
close enough to one another. If the signal to noise is not too different from unity, 
detection of a “noise” photon will occur in the coincidence window, leading to 
an ambiguous result. During post-experiment analysis, such results may be 
discarded from consideration, creating a coincidence loophole . This loophole 
can be minimized by narrowing the coincidence window and simultaneously 
increasing signal to noise by using a brighter photon source. The adoption of 
parametric down-conversion using BBO, which we introduced in Chap. 2 (see 
Fig. 2.16), has been an important contribution toward elimination of the 
coincidence loophole. A good demonstration is that of Kwiat and co-workers 
(http://arxiv.org/abs/quant-ph/9810003) (Kwiat et al. 1999). They have measured 
a Bell function (6.42) of S = 2.7, proving a violation of the Bell limit by over 
200 standard deviations. They generated entangled pairs of photons at 702 nm 
using parametric down-conversion in BBO. Nonetheless, their experiment did 
not close the locality loophole. The detection stations were not separated far 
enough spatially to preclude communication between detectors, and fast 
switching between polarizers was not used to ensure random selection of 
polarization orientation. 

Experiments that test Bell’s equalities are relatively simple to describe, but 




the devil is in the details. There are additional important loopholes than those 
described here, and experiments continue to demonstrate a loophole-free test of 
Bell’s inequality. The work of Hensen et al. (2015) cited earlier in this chapter 
has succeeded on closing several important loopholes: locality, detection 
efficiencies and fair sampling . This is an excellent achievement, but more work 
remains to be done. 

6.4.2 Locality, Reality and Free-Choice 

Equations (6.18)-(6.20) show that an observer at A cannot distinguish by a 
measurement on a single photon whether it belongs to an entangled pair or 
whether it is an uncorrelated quantum with random polarization. This photon 
will have a determined polarization if a measurement has previously been made 
on the other photon of the pair. This mental exercise suggests that there is a level 
of reality that quantum mechanics cannot describe. This situation poses a 
question: does quantum mechanics describe reality, as Einstein required, or does 
quantum mechanics describe the observer’s state of knowledge about reality? Is 
there a deterministic reality that we can’t see, with standard quantum mechanics 
describing our limited knowledge? 

A theory developed by David Bohm and Louis de Broglie treats the quantum 
as both a particle and a wave. The particle has well-defined eigenvalues: energy, 
momentum, position, polarization, spin, etc., just like a classical object, and is 
controlled by a wavefunction that determines how it can be observed. This 
theory reproduces all the results of standard quantum mechanics, including the 
fundamental uncertainty relationships. There is an important difference with 
respect to standard quantum mechanics. In the Bohm-de Broglie theory, the 
quantum does not reside in a “ghostly” superposition of eigenstates prior to 
measurement, with wavefunction collapse on measurement as postulated by the 
Copenhagen interpretation . Instead, an observable, such as polarization, always 
has a defined value. Its evolution in space and time is governed by a 
wavefunction, called a pilot wave . A good number of serious physicists, 
including John Bell himself, have supported this theory as a viable alternative to 
conventional quantum mechanics. 

There are three key concepts that are present in the EPR question: Locality, 
Reality and Free-choice. From EPR to the development of a rigorous experiment 
by Aspect, the question of locality in quantum mechanics has been deeply 
scrutinized. Quantum entanglement introduces non-locality and the effect of this 
non-locality can be measured. To date all experiments show that the same 
particular experimental conditions demonstrate experimental results that cannot 


be explained by local influences alone. 


6.4.3 Free-Will 

Bell’s theorem and subsequent experiments show that there is no local hidden- 
variable that can reproduce all the results of quantum mechanical behavior. 
However, a theory based on non-local hidden variables does not have this 
limitation. The Bohm-de Broglie is a step in this direction. The concept of non¬ 
local hidden variables leads quickly to the idea of pre-determined outcomes , not 
in an individual laboratory, but in the universe at large. This brings the question 
of free-will to the fore. Is the experimenter truly free to choose the apparatus and 
experiments to be performed, or are his actions influenced by conditions over 
which he has no knowledge or control? This is the situation that Goethe imposed 
on Marguerite. 

Bell addressed the dilemma of determinism in an interview with BBC radio 
in 1985. 

There is a way to escape the inference of superluminal speeds and spooky 
action at a distance , but it involves absolute determinism in the universe, 
the complete absence of free will. Suppose the world is super-deterministic, 
with not just inanimate nature running on behind-the-scenes clockwork, but 
with our behavior, including our belief that we are free to choose to do one 
experiment rather than another, absolutely predetermined, including the 
‘decision’ by the experimenter to carry out one set of measurements rather 
than another, the difficulty disappears. There is no need for a faster-than- 
light signal to tell particle A what measurement has been carried out on 
particle B, because the universe, including particle A, already ‘knows’ what 
that measurement, and its outcome, will be. 4 

Shimony et al. (1976) have also addressed this issue: 

In any scientific experiment in which two or more variables are supposed to 
be randomly selected, one can always conjecture that some factor in the 
overlap of the backward light cones has controlled the presumably random 
choices. But, we maintain, skepticism of this sort will essentially dismiss all 
results of scientific experimentation . Unless we proceed under the 
assumption that [superdeterminism does not hold], we have abandoned in 
advance the whole enterprise of discovering the laws of nature by 
experimentation. 5 


6.5 Summary 

Entanglement describes a situation in which two entities share a common 
attribute. In the case of two photons, this attribute could be polarization. While 
the net polarization of the pair of entangled photons is well-defined, the 
polarization of the individual photons is not. If the photons are physically 
separated by a beamsplitter, they can each be brought to a different 
measurement apparatus. However, the determination by measurement of the 
polarization of one photons will fix instantaneously and with certainty the 
polarization of the second photon so that the net polarization is not altered by 
measurement. 

Measurement causes the entangled state wavefunction to collapse. Once the 
polarization of one photon is determined, then the polarization of the other is 
also known. This between two photons has been shown experimentally to exist 
over significant macroscopic distances, and provides evidence for the non¬ 
locality in physical measurements. That is, the measurement of a property of one 
particle at location A is not completely determined by conditions at location A. 
The results at location A can also be affected by the results of a measurement of 
another particle at a different location B, provided that these particles are 
entangled. 

Bell’s theorem analyzes the outcomes of measurements at two different 
locations, A and B under the assumption of locality. That is the outcome at A is 
determined only by the settings of the measurement equipment at location A, 
and the properties of the object to be measured at A. In this context, the theorem 
allows for the presence of a “local hidden variable” that accounts for some or all 
of the apparent statistical nature of the measurements at locations A or B. Bell’s 
analysis places upper limits on the correlations that can be measured between 
results recorded at locations A and B, regardless of the nature or form of the 
hidden variable. 

For more than 2 decades, careful measurements of entangled photon pairs 
demonstrate significant violations of Bell’s limit. We have shown that the 
measured results can be explained by quantum mechanical analysis, and that the 
assumption of locality is not valid. The polarization entanglement of a photon 
pair can be described by a wavefunction that is non-local, spread over both 
photons of the pair. Measurement of the polarization one photon forces this 
wavefunction collapse into two disentangled states with distinct polarizations. 

The demonstration of the non-local behavior of entangled photon pairs is a 
foundation stone for quantum cryptography . This technology of ultra-secure 



communication is based on the distribution to authorized receivers of a quantum 
key that enables reading of encrypted messages. The quantum key is determined 
by the detection of photons correlated by entanglement at two different 
locations. Any attempt to intercept the photons that constitute the quantum key 
destroys the correlation, by wavefunction collapse. The cryptographic key is 
composed only of those detected photons for which the correlation is preserved. 
Using the quantum key, encrypted communications can be exchanged over 
conventional communication networks without having to worry about 
interception. 


Exercises 

6.1 Single photon detection can be achieved using a photomultiplier or a 

silicon avalanche photodiode, both of which work best for visible and near- 
ir photons. The quantum efficiency T l of these detectors is always less than 

unity. 

(a) Show that the expression for the Bell function is modified to: 

2 - - < E (a. b) - E (a. c) + E (d, c) + E (d. b) < -— 2 

n n 

(b) Presuming that an experiment is carried out using entangled photons 
with polarizers oriented at the optimum settings of f* f- f- if. At what 

level of quantum efficiency will it no longer be possible to detect a 
violation of Bell’s inequality? 
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Footnotes 


1 For example, in 2015, a group led by R. Hensen at the Delft University of Technology achieved an 
elegant demonstration using a combination of entangled electron spins and photon polarisations (2015). 
This is an area of active research and discovery. 


2 This section is adapted from, “The Naive View of an Experimentalist” by Aspect (2002). 


3 This section is based on the demonstration and discussion by Bell (1987). 


4 Excerpt from the interview on BBC radio, conducted by Paul Davies. A transcript can be found in the 
book Davies and Brown (1993). 


5 Shimony et al. (1976), Bell et al. (1985). 
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Abstract 

LASER is an acronym for Light Amplification by Stimulated Emission of 
Radiation. Laser action is a general principle of the behavior of light absorption 
and emission by matter. A laser is an amplifier with positive feedback. 
Amplification is generated by stimulated emission of photons, and positive 
feedback is achieved using mirrors. The physics of spontaneous and stimulated 
emission of photons is directly seen though quantization of the electromagnetic 
fields. The transition rate for stimulated emission depends on the number of 
photons already present in the electro-magnetic mode. Spontaneous emission can 
only be understood as a quantum reaction with the vacuum. Photons created by 
spontaneous emission act as seeds for subsequent stimulated optical transitions. 
Population inversion is needed to raise the stimulated emission rate above the 
rate of spontaneous emission. Optical gain will occur if the stimulated emission 
rate exceeds the stimulated absorption rate. Such a situation cannot occur in a 
two level system because the same two levels are responsible both for absorption 
and emission of photons. In a three or four level system, the equilibrium between 
absorption and emission is maintained, but the absorption takes place between 
one set of levels and the emission takes place between a different set. A critical 
parameter for laser performance is the threshold current density. The double¬ 
heterostructure laser concept is based on confinement of the optical mode and 
the recombination of electrons and holes to the same region of space. 

Application of quantum confinement has dramatically improved this design by 
separate optimization of the optical confinement region and the electron-hole 



recombination region. This results in a steep reduction in the threshold current 
and a gain factor that is independent of operating drive current. 


7.1 Introduction 

LASER is an acronym for Light Amplification by Stimulated Emission of 
Radiation. Laser action is a general principle of the behavior of light absorption 
and emission by matter. As a result, lasing has been observed in a wide range of 
materials and conditions where luminescence is generated, including chemical 
reactions, antifreeze, gases, solids, liquids, and semiconductor p-n junctions. 
Even water can be made to support lasing in the far infra red. Semiconductor p-n 
junctions are among the materials in which laser action can be achieved most 
easily. 

Semiconductor lasers cover a very wide range of optical wavelengths. Lasers 
can be built that span a range from less than 400 nm to more than 10,000 nm. No 
other materials system has this flexibility. Semiconductor lasers are relatively 
inexpensive. The cheapest examples sell for less than a dollar, and the most 
expensive for less than $10,000. As a result of these and other considerations, 
the semiconductor laser in by far the industry leader in terms both of the number 
of units sold and the volume of revenues. Semiconductor laser diodes are the key 
component in a number of common devices such as a CD-player, scanner, 
printer, and DVD reader. They are also a key component in optical fiber 
telecommunications for generating the lightwave that travels down the fiber. 


7.2 Spontaneous and Stimulated Emission 

In this section we will consider optical absorption and emission in a finite 
resonant cavity of length L, such as the active region of a semiconductor laser. 

To a good approximation, the electric and magnetic fields can be considered as 
plane waves. The direction of propagation is taken along the z-axis, the magnetic 
field along the x-axis and the electric field along the y-axis. 

The allowed modes of this cavity are quantized: & = f, where / is an integer. 

The cavity modes can be written down: 





where V is the mode volume. 

Using (5.27) and (5.28), we can express the total resonator fields as: 


Hy (r, t) = -a>t \\—Qi (t) cos (kiz) 
V/j 
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(7.2) 


Using (5.33) and (5.34), we can transform the expressions for the magnetic 
and electric fields directly into quantized form: 
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(7.3) 


The strength of the emission or absorption of photons can be expressed as 
the product of the optical transition rate , the probability that the initial state is 
occupied and the probability that the final state is empty. 

The transition rate from state 2 to 1 is given by Fermi’s golden rule, and can 
be derived easily from first-order perturbation theory: 


^ YHz-*\ f 6(E 2 - E I - Ttbj) 
n 


(7.4) 


W 2 ^ 11 is the interaction Hamiltonian operator for the photonic transition: 


= -?E i (r, t) ■ r = -qEi(Zy 0>' (7.5) 

where q = the electronic charge 


Eh-* i = iqy 




(7.6) 


We consider first the transition rate for optical emission. In this case an 
electron in state 2, with n photons in the field, makes a transition to state 1 
accompanied by the emission of a photon so that the final photon state has n + 1 
photons present. The mode / of the radiation field has frequency ^ where 
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W 2 ^i = -^p- |0 1 n + 11 yaj 12,n>| 2 sin 2 {k/z) <S (A E - Tilj) 

Using the property of the creation operator : ( n + 1 1 a+ 1”) = V« + 1, 

Wa-H = —pp( n + 1 ) 1(11 y I 2 )! 2 sin 2 to - M (7.10) 

The number : 1 1 > ! |2)|- = , is the dipole transition matrix element between 

^2 and ‘U. Note that > 2^1 = y\~*r 

Assuming that energy is conserved, we can simplify (7.10) in order to 
highlight the most important results: 

Vt^i - Wjn + l)sin 2 to (7.11) 

The first term in this expression is the transition rate for stimulated emission 
, because it depends on the number of photons already present in the electro¬ 
magnetic mode. The stimulated photon is emitted into the same mode. This 
means that the emission of one additional quantum is stimulated by those already 
present. This stimulated emission acts like a linear amplifier . 

However, this is not all. There is an additional component to the photonic 
transition rate that is present for all transitions, even when there are no photons 
present. This is the second term, and it is called appropriately spontaneous 
emission . 

Stimulated emission: W:_,i = /j-- —= z ^- sin - {kiz) d (A E - fiw) 

Vs 

and 

ZlTt/AjjV 2 

Spontaneous emission: VU-, 1 =-sin - (kiz) 6 (A E - tiw) 

Vs 

Spontaneous emission always accompanies the transition of a electron from a 
higher to lower energy state. The energy of the emitted photon is equal to the 
energy difference between these two states, conserving energy. Maxwell’s 
equations do not account for spontaneous emission because the photon is created 
from the vacuum or empty state. This is to be expected, because Maxwell’s 
equations describe the continuous fields resulting from many electrons and 
photons. The transition of an electron between two energy levels accompanied 
by the creation of a photon can only be described correctly by the quantized 
expressions of (7.3). 

Because spontaneous emission is a quantum event does not mean it is rare. 


(7.12) 

(7.13) 







At room temperature, spontaneous emission of photons dominates stimulated 
emission in most circumstances. The LEDs used for general lighting are bright 
sources of light beams composed almost entirely of spontaneously emitted 
photons. 

The transition rate for absorption can be determined in the same way. In this 
case an electron in state 1, with n photons in the field, makes a transition to state 
2 accompanied by the absorption of a photon so that the final photon state has 
n - 1 photons present. The mode / of the radiation field has frequency ^ where 

Tiojj = E-2 — E\ = A E 


Ti =|M> 
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The transition rate is: 
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The transition rate for stimulated absorption is equal to the transition rate for 
stimulated emission . Note that a “spontaneous absorption” term does not occur 
in the analysis. Such an event would violate energy conservation . 

We can directly relate the occupation number n to the intensity of the field 
as: 
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and replacing sirr by its average value of Vi , we can rewrite (7.17) as 
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making it explicit that the stimulated emission rate is proportional to the 
photon population in the lasing mode. 

Before proceeding, we should review these key results which create the basis 
for laser operation. 

Laser stands for Light Amplification by Stimulated Emission of Radiation. 

By treating the radiation field as a set of modes /, each of which can be 
thought of as a harmonic oscillator, we have used the creation and annihilation 







operators to describe the interaction of photons and electrons in a cavity of 
length L. 

The emission of light in this cavity consists of two contributions: 

Spontaneous emission and Stimulated emission 

The transition rate for stimulated emission is proportional to the number of 
photons in the cavity. This means that stimulated emission amplifies the photon 
number in the cavity by moving up the “ladder” of levels of the mode in 
question. This is an important component of laser action. 

If emission occurs in a resonant cavity, for example one with mirrors on each 
end, then emitted photons will be retro-reflected into the cavity, causing more 
photons to be emitted. This positive feedback favors emission at the frequency 
that has the largest dipole transition element. 

Spontaneous emission is always present. Spontaneously emitted photons 
provide the “kindling” that allows stimulated emission to begin. 

The transition rate for stimulated absorption is equal to that for stimulated 
emission. 


7.3 Laser Action 

A laser consists of two components: a photon amplifier and a positive feedback 
circuit. The gain spectrum is the range of optical wavelengths (frequencies) over 
which light emission exceeds absorption. Amplification is generated by 
stimulated emission which says that the probability for photon emission is 
proportional to the number of photons already present. Positive feedback is 
achieved by a pair of reflectors so that light exiting the laser amplifier is returned 
back to the amplifier medium. In an operational laser, photon amplification 
creates optical emission gain . Laser action occurs when the gain exceeds the 
photon absorption rate and photon losses in the resonant cavity . 

Laser action is an example of the general principle that an amplifier coupled 
with positive feedback generates oscillation where most or all of the energy is 
emitted in a narrow frequency band. Another example is the audio event that 
occurs when a microphone is placed in front of a loudspeaker. The combination 
of amplification plus positive feedback produces a piercing tone, the frequency 
of which occurs at the maximum of the gain spectrum of the amplifier. A third 
example, (in which you have probably participated) is the rhythmic applause at 
the end of a popular concert. Starting as random noise, (spontaneous emission) a 
frequency emerges, and soon almost everyone in the audience is clapping with 
the same rhythm (stimulated emission). The emergence of this frequency 




depends on feedback via your ears and the response of your arm muscles which 
amplify the detected signal. 

7.3.1 Analysis of Spontaneous and Stimulated Emission 
—A Macroscopic Description 

Analysis of the quantized electro-magnetic field is a powerful method for 
understanding the physics of quantum emission and absorption. It is, however, 
not a helpful method for understanding the design and operation of efficient 
lasers. This requires a macroscopic analysis based on the dynamics of many 
photons and electrons simultaneously (Agrawal and Dutta 1986; Bhattacharya 
1994). 

In Fig. 7.1 we diagram in a very schematic way the three possible transitions 
that can take place in the absorption or emission of light. We choose a simple 
two-level system having N 2 states in the upper energy level occupied by 

electrons and N 1 states in the lower energy level occupied by electrons. In 
equilibrium, N 2 < IV 1 and the ratio between the two occupation numbers is 

given by Boltzmann statistics , that is: iv; = e ^. The probability that an electron 

can make a transition from the lower energy state to the upper energy state by 
stimulated absorption (7.1) is equal to B 12 . Because stimulated absorption is the 

reverse of stimulated emission, the probability for stimulated emission to occur 
(7.1) is given by B 21 . The probability for spontaneous emission is different and 

we will call this A 21 . We would like to compare the number of absorption 
transitions to the number of emission transitions, in order to calculate the gain. 
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Fig. 7.1 Diagram of the three possible electron-photon interactions. Stimulated emission (a) and stimulated 
absorption (b) refer to the fact the probability for absorption or emission depends on how many other 
photons having the energy difference of the transition are already present. In a spontaneous emission 
process shown in (c), the probability of emission does not depend on the presence of other photons 

The number of spontaneous transitions is given by the number of occupied 
states N 2 multiplied by the probability of a transition: N 2 A 21 . The number of 

stimulated transitions also depends on pUP, which is the density of photon states 

having an energy equal to the transition energy. The number of transitions by 
stimulated emission is: N 2 B 21 p(co). The total number of transitions where a 

photon is emitted is the sum of these two terms. The number of absorbing 
transitions depends on the number of occupied states in the lower level N x and 

the number of photons present having the right energy; PN: N x B 12 p(co). 

Under steady-state conditions, the number of absorbing transitions equals the 
number of emitting transitions. We can summarize the discussion so far in a set 
of simple equations: 



(7.20) 


Emission rate = W 2 1 = Ii 2 ip {tia/) + A 2l 
Absorption rate = W l2 = B u p (hto) 
and 

(M = n 2 b 2 \p (ha>) + n 2 a 2 i 

This allows us to solve for the photon density at the energy of the transition: 
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(7.21) 


N x B n ~ N 2 B 2 1 ^^12 _ | 

Now we will compare' tins expression for P to another one based on the 


Planck radiation law . We discussed Planck’s experiments in Chap. 1. The result 
of his work was to derive an expression for the energy density of photons. We 
recall that Planck discovered that the energy density depends on the temperature 
and on the color, or energy of an individual photon. Planck’s radiation law states: 
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In comparing (7.21) and (7.22), 
we know from Boltzmann statistics 


(7.22) 

we can see some similarities. For example, 
that: ^7 - c w . Therefore, it follows that: 


wi. 

N-, 


= e 


w 


= e 


htij 

W 


We can see that the two equations are identical when: 


B 12 - ^21 

a " d (7 23 
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B 2 l “ A 3 ~ c 3 

The two expressions in (7.23) are called the Einstein relations , in which c is 
the speed of light, and n is the index of refraction of the medium involved 
(Casey and Panish 1978). 

The ratio of the spontaneous emission rate to the stimulated emission rate is: 




p(oj)B 2l 




(7.24) 


This ratio is normally much greater than unity: that is, the spontaneous 
emission rate far exceeds the stimulated emission rate. If we consider photons 

with an energy of 1 eV, (A = 1240 nm) then ~ 4!J at room temperature. R, as a 


result, is a very big number. 

However, in order to have laser action, the reverse must be true: that is the 
stimulated emission rate must be greater than the spontaneous emission rate. To 









see how this can happen, read on through the next section. 

7.3.2 Optical Gain 

Optical gain and optical absorption are closely related. When light is incident on 
a semiconductor surface, only two things can happen: reflection or transmission. 
Normally, both occur at the same time. Transmitted photons can subsequently be 
absorbed. Absorption can take place if the energy of the incident photons is 
greater than the band gap. Absorption does not occur all in one spot at the 
surface, but rather progressively as the photons propagate into the 
semiconductor. At any point inside the semiconductor, the amount of light that 
gets absorbed is proportional to the total intensity that is present, following the 
principle of stimulated absorption in (7.17). The constant of proportionality is 
called the absorption coefficient. This simple model gives an excellent 
description of reality (Fig. 7.2). 





Fig. 7.2 A simple schematic diagram of light passing through a section of material where absorption is 
taking place. Absorption causes the intensity of light to decrease as a function of distance traveled. The 
change in the intensity between points x and x + Ax is proportional to the intensity at point x 

We can write down an equation that describes this situation: 

/ (jc + Ajt) - / (jc) = AI (jc) 
and 

A / ( x) = -a 1 (x) 

in the limit of small Ax, (7.25) 

A/ (*) = - al { x); 

ax 

I U) =I i ^~ ax W cm -2 

The value of a depends on the material and on the photon energy. For 
example, in the case of GaAs, a is about 10 4 cm -1 for photons having an energy 







greater than the bandgap (E g = 1.43 eV at room temperature). For photons that 

are less energetic than the bandgap energy, a is three orders of magnitude 
smaller. When the absorption coefficient is a positive number, the intensity of the 
beam decreases as the light propagates through the material. However, suppose 
that a could be made to be negative, what would be the result? 


Example 

Abeam of light of monochromatic photons of energy 1.5 eV strikes the surface 
of GaAs at normal incidence. What percentage of the original photon beam 
penetrates 1 pm beneath the surface? What percentage penetrates 10 pm beneath 
the surface? 

Solution: 

There are only two things that can happen to photons incident on an 
interface. Either they are reflected or transmitted. Some of the transmitted 
photons are subsequently absorbed. To answer these questions you need to 
find first of all the percentage of light that is transmitted, and then find out 
what fraction of those photons are absorbed. 


The percentage of light reflected is calculated from Fresnel’s equation at normal 
incidence: 


(ft ~ l) 2 G-4 f 
{>* + l) 2 (4.4J 2 


T = 1 - R = 0.75 


0.25 


So, 75% of the light is transmitted. 
To calculate the intensity: 


/ = 
/ = 



/ -(Iff 1 cm“ l 


> 


I = I (t e 1 for 1 (im penetration, 

/ = /„(0.37) = 0.75 x 0.37 = 0.28 


For 10 \im penetration, 

/ = 0.75 x 4.6 x 10“ 5 = 3.4 x 1Q" 5 

To continue our discussion of absorption, consider what happens to the 
number of photons M per unit volume, or the photon density, as a function of 

time. The photon density will decrease with as the number of electron transitions 
from level 1 to level 2 increases. The density will increase when the number of 



transitions from state 2 to state 1 increases: 
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The photon density is closely related to the energy density: = N - fiw. 


Similarly, the intensity is related to the energy density: 1 - P _ 


In (7.23) we derived a relationship between intensity and distance. Because 
of the relationship between the intensity and the photon density, we can write 
another expression for the gradient. 
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For the case of light: = *. Since this is a simple constant, the inverse 
expression that we would like to substitute in (7.25) is the arithmetic inverse. 
That is: 57 = r. 

Using these results we can determine the condition for generating optical 
gain. 

jN = = J - 1 <*) ‘ H*) = -r-p <* w ) - 

Jr hf dx ha* Tia> n 

Using (7.24), 
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(7.28) 


So a is positive, and absorption occurs when N x > N 2 . On the other hand, a 
is negative and amplification occurs when N 2 > N 1 . This simple condition is 

called population inversion. You may notice that although simple, it appears to 
violate the requirements of Boltzmann statistics . Thus population inversion is 
not an equilibrium condition , but it can be obtained in steady state (Pearsall 
2008). The art of making a laser is understanding how this condition can be 
achieved in real materials (Fig. 7.3). 
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Fig. 7.3 Photon absorption , transparency and emission . The presence of photons in a medium (gaseous, 
liquid or solid) stimulates transitions of electrons between energy levels. These transitions are accompanied 
by either photon emission or absorption in order to conserve energy. The net result depends on the 

difference between the number of occupied states in the two levels. When < N\ _ ne t absorption 
occurs. When Nj > N\ _ ne t emission occurs. This leads to optical amplification and laser action. The 
medium is said to be transparent when = A'a 


When a is negative, the result is gain rather than loss, and instead of using a 
to describe this condition, we should define a gain coefficient k, where: 


Gain Coefficient = k = -a 
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7.3.3 n 2 > n , : Population Inversion 

So far we have considered light emission from a system of electrons having two 
energy levels, E x and E 2 . Looking at Fig. 7.1 again, you can see that there is 

one way for electrons to get pumped into the upper level, by stimulated 
absorption . We have demonstrated that this rate is equal to the stimulated 
emission rate. However there is a second way for electrons to be de-excited from 
the upper level, by spontaneous emission. We have shown that this rate is much 
bigger than the stimulated absorption rate. The result is that there is no way to 
obtain a population inversion in a 2-level system. 



































Of course, you might be able to get an inversion if electrons were somehow 
fed into the upper level by another source- a third level. This turns out to be the 
road to the solution. In general lasing is easiest to obtain in a four-level 
arrangement that is diagrammed schematically in Fig. 7.4. 
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Fig. 7.4 Population inversion can be obtained in a 4-level system , in this case between level 4 and level 3. 
The excitation and recombination cycle is given in sequence by the numbers in the figure. There are four 
steps to the complete cycle 

In the beginning of the cycle, all the electrons are in the ground state. If level 
£ 3 is a few k B T above level £ h it will be nearly empty by Boltzmann statistics 

. The cycle starts when a high energy photon with energy ^ = Ei - E\ excites an 

electron from the ground state to the excited state (step 1). The photon is a 
particle, so all its energy must be absorbed, making a direct transition to £ 3 or £ 

4 impossible. After excitation the electron can be scattered into state £ 4 during a 
collision (step 2). Electrons are more likely to end up in state £ 4 than state £ 3 or 
state £ 1 because the energy difference is smaller, and so easier to make up by 
phonon emission. After step 2 there are electrons in state £ 4 but not in state £ 3 . 
Thus, a population inversion between these two levels now exists. The 
recombination that follows is an example of optical gain, since emission between 
these levels far exceeds absorption which is practically 0 (step 3). This transition 
can be a lasing transition if suitable feedback is provided, which increases the 
rate of stimulated emission. Finally, electrons that reach level E 3 are recycled to 

level £ leaving state £ 3 empty again (step 4). In this example, the number of 

photons absorbed is still equal to the number of photons emitted. However there 
is now one set of levels that does most of the absorption, and another set that 
generates most of the emission. 








A semiconductor laser is a good example of a 4-level system, and this can be 
understood quickly from a simple band structure diagram such as that in Fig. 7.5. 
Optical stimulation of lasing is relatively easy to demonstrate in a direct gap 
material, and it proceeds following the cycle outlined above. However, the cycle 
for obtaining gain by electrical excitation is quite different. In this case the 
behavior of the p-n junction is used to create a population inversion. 



Fig. 7.5 A direct bandgap semiconductor is a good example of a 4-level medium for laser action. There is a 
significant difference in the pumping scheme, because level E 3 is initially fully occupied by electrons. 

However, as soon as a photon is absorbed, electrons leave this level to fill the hole in the valence band that 
is created by the excitation 

The pumping cycle in Fig. 7.5 is different from the cycle in Fig. 7.4. Initially, 
level E 3 is fully occupied by electrons. Optical excitation proceeds by the 

absorption of a photon (1). In order to conserve energy and momentum, the 







electron that is excited to the conduction band must originate deep in the valence 
band as shown. Then nearly simultaneously, the excited electron in the 
conduction band relaxes to state E 4 and the electron in state E 3 relaxes to state 

E leaving a hole behind. (2) Relaxation takes place by emission of phonons, 
and is completed in 10“ 12 s. Now there is an electron in state E 4 and a hole in 
state E 3 creating a population inversion. This situation can persist for about 

10 -9 s. That is three orders of magnitude longer than the relaxation process. 
Finally, recombination occurs across the gap (3). This transition can be used to 
make a laser, if suitable feedback is provided. 

In a semiconductor material, both spontaneous and stimulated emission 
proceed by this “4-step” process. No matter what the energy of the optical 
excitation energy above the bandgap, the energy of the photons emitted during 
recombination is always close to the bandgap energy. Optical gain only occurs in 
this range of energies which we can denote by a distribution function, 8 (0 called 

the optical gain spectrum , v being the photon frequency (Fig. 7.6). This is 

always the situation because the lifetime for optical recombination is orders of 
magnitude longer than the time for phonon emission . The lower energy portion 
of the gain curve rises in proportion to the joint density of states . The high 
energy cutoff is due to self absorption of photon emission. Typical values for the 
energy width of the gain spectrum lie in the range of 0.01-0.02 eV. In order to 
take account of this feature we will modify (7.27) for the gain coefficient: 



Fig. 7.6 Gain spectrum The gain spectrum measures the difference between emission and absorption as a 
function of frequency or energy. The rise in the gain at lower energies is due to the increasing density of 
states while the fall-off at higher energies is the result of the onset of self-absorption of the emitted photons. 
The half width of the gain spectrum is approximately equal to kT. The figure shows a comparison between 
the measured (dots) and calculated (solid line ) gain spectra for GalnAsP, a laser medium which is widely 



used to make semiconductor diode lasers for optical fiber telecommunications (from Goebel 1982, 
reproduced by permission) 


k = 


(N 2 - iVi) P 2 i 


nhti>g (v) 
c 


(7.30) 


7.4 Semiconductor Diode Lasers 

Up to this point, we have considered principles of laser action. We will continue 
the development in the specific context of semiconductor diode lasers. The 
semiconductor laser is a prime example of a quantum photonic device. 

Today, semiconductor lasers represent 80% of the laser market value, and the 
percentage is even higher if we consider the number of lasers sold each year. 
These percentages are on the increase. Other types of lasers, such as fiber lasers 
or disk lasers, which are widely used for welding and machining of steel, are 
pumped optically by diode lasers. There is a trend to replace some of these laser 
systems by direct diode laser processing. 

The fabrication of semiconductor lasers integrates material properties, 
photon resonant cavity design and the design of electron-photon recombination. 

7.4.1 Population Inversion in Semiconductor Materials 

The most important materials requirement concerns the relative lifetimes of the 
various steps shown in Fig. 7.5. In order to build up a population inversion in 
level 4 compared with level 3, the lifetime of step 2 should be shorter than the 
lifetime of step 3. This insures that the filling of level 4 occurs at a faster rate 
than it empties by spontaneous emission . Similarly, the lifetime of step 4 should 
also be shorter that the lifetime of step 3. This insures that level 3 will have 
enough empty states (holes) to accommodate the electrons that transition from 
level 4. 


T 2 < Tj 

U < 


(7.31) 


Steps 2 and 4 are accomplished by interband scattering of electrons by 
optical phonons. The lifetimes are similar and in the range of r 2 - u 58 Id -12 s 


The lifetime for optical emission should be longer, but not so long that electrons 
lose their energy by non-radiative recombination. This requirement is satisfied 
by direct bandgap semiconductors , like GaAs. The lifetime for radiative 

recombination across the direct bandgap is 55 h) 1 s. This condition is not 




satisfied by indirect bandgap semiconductors like Si, which explains why laser 
action by emission across the bandgap has not been measured in this material. 

Carrier injection in p-n junction is a convenient and efficient way to 
introduce excess electrons in the conduction band. This allows us to implement 
step 1 by electrical pumping. We show in Fig. 7.7 the energy level diagram for a 
p-n junction, heavily doped so that the Fermi level actually lies in the conduction 
band on the n-side and in the valence band on the p-side. 
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Fig. 7.7 Energy level diagram for a heavily-doped p-n homojunction a 0-bias. Fermi level lies in the 
valence band (VB) on the p-side and the conduction band (CB) on the n-side. b Moderate forward bias 







(Vbias < ^built-in) : The depletion width Aw narrows, but the electron and hole populations do not 

overlap, and photonic recombination is negligible, c Large forward bias (Vbi as > ^built-in): The depletion 
width reaches a minimum value. For GaAs, this is about 300 nm. The electron and hole populations overlap 
in space, and photonic recombination takes place 


Figure 7.7a, b and c show the p-junction in states of increasing forward bias 
voltage. The physical width of the charge depletion region narrows as the bias 
voltage increases. When the bias voltage exceeds the built-in voltage, it creates 
1-dimensional carrier confinement for both electrons and holes as shown on 
Fig. 7.7c. In this regime, a state of population inversion can be attained, 
supporting laser action. Robert Hall of General Electric Laboratories made the 
first semiconductor p-n diode laser in 1962 using GaAs. While this 
demonstration validated the physical principle, more invention and engineering 
is necessary to build a semiconductor laser that can function efficiently at room 
temperature. The key to the success of semiconductor lasers over all other 
varieties is that it is possible by materials engineering to modify important 
parameters that directly impact laser device performance. One such key 
parameter, shown in Fig. 7.7 is Avv ', which is the dimension of the region where 
population inversion occurs. 

To summarize, a laser is an amplifier with positive feedback. We have 
determined that the condition for amplification to occur is a population 
inversion, and we have described how this can be achieved in a p-n junction. The 
other half of the requirement is to create optical feedback. 

7.4.2 Optical Feedback —Making a Laser 

A simple optical resonator can be formed by 2 parallel mirrors. In a 
semiconductor like GaAs, the mirrors can be obtained by cleaving the GaAs 
material along crystal planes, which yields atomically-flat, parallel surfaces. The 
gain coefficient of direct band-gap semiconductors like GaAs is very large 
compared to that of gas lasers, like He-Ne or solid state laser materials like Nd- 
YAG. As a result the mirrors at each end of the cavity do not need to be as 
efficient as those required for other kinds of laser materials. There are two big 
performance benefits: one is that more power can be extracted from a 
semiconductor laser at modest input power levels, and the second is that there is 
a much larger tolerance in the design of the resonator in order to make a working 
device. There is also a big space savings, too. This is why you can hold a 
semiconductor laser on the tip of your finger, and you need a table top to hold a 
gas laser or a YAG laser. 



A forward bias voltage applied to the diode creates excess concentrations of 
electron-hole pairs. Electron-hole recombination generates photons that depart in 
all directions by spontaneous emission. Some of these photons will travel along 
the line that is perpendicular to the reflecting surface of the two parallel mirrors. 
These photons will be reflected and will travel back into the diode along the 
same path. Of course, there will be some loss in this process. Some photons will 
be re-absorbed along the way. Others will be scattered out of alignment by 
defects in the optical path. Most important of all, some will traverse the mirror 
and be emitted into free space. This is the useful output of the device. Reflected 
photons will stimulate the emission of other photons. These stimulated photons 
contribute to the electromagnetic field creating gain. In the development leading 
to (7.17), we have shown that these photons must have the same energy and the 
same phase as the stimulating photon. If it were otherwise these photons would 
interfere destructively with the electromagnetic field. In order for lasing to occur, 
the gain initiated by a photon during one round trip circuit must be greater than 
the losses incurred during the same circuit including the fraction of the intensity 
that is emitted through the reflecting mirrors . Lasing threshold is defined as the 
point when the gain is equal to the loss. 

We can treat any loss along the optical path by an effective absorption 
coefficient which we will denote by y. The gain coefficient k behaves just like a 
loss with the opposite sign. The net laser gain can be expressed: 


Gain = R i R 2 ^ )L (7.32) 

R : and R 2 are the reflectivities of the mirrors at either end of the gain region, 
and L is the cavity length. In the case of a semiconductor laser Rj^ and R 2 are 

usually the same. During laser operation, the only variable in this expression is 
the gain coefficient which depends on the pumping rate, that is, on the forward 
current in the diode. Everything else remains constant. 

The laser threshold is reached when the net gain is unity. This also defines 
the threshold gain. 
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in the case where Rj and R? arc the same, 


(7.33) 
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Although spontaneous emission diverts light from lasing modes, reducing the 



laser efficiency, its presence is absolutely required to make the laser work in the 
first place. The spontaneous emission “primes the pump” in the beginning by 
filling all possible radiation modes with photons. Gain and laser action then 
build up out of the noise in the much smaller number of modes that overlap in 
energy with the gain spectrum and which are resonant modes of the reflecting 
cavity (Fig. 7.8). 

In general there are a number of modes that lie in the gain spectrum . The 
exact number can always be calculated from the width of the gain spectrum, the 
cavity length and the wavelength. Because the gain of semiconductor materials 
is so large, lasing action often occurs at several modes simultaneously. Single¬ 
mode operation can be achieved by making the cavity wavelength selective. 

At this point we have assembled all the elements of a laser: gain , population 
inversion , and a resonant cavity for feedback. It remains only to determine the 
level of current injection into the diode that is required in order to achieve laser 
action. 


7.4.3 Laser Threshold 

In a p-n junction laser, population inversion is achieved by electrical pumping . 
The threshold current is attained when the number of electrons per second being 
injected into the diode is equal to the threshold population density, taking into 
account that some of the electrons will be lost to recombination before a suitable 
population inversion is built up. 

The number of electrons injected per second per square cm into the diode is 
just the current density divided by the electronic charge: q . If we consider the 

rate of electrons per second in the recombination region we need to divide this 
expression by the thickness t (= Aw in Fig. 7.7) of the recombination region. 


Pumping Rate = — (electrons s 1 cm 3 ) 
qt 


(7.34) 


What goes in must come out, and so the recombination rate must equal the 
pumping rate. This is the optoelectronic equivalent of the principle that 
absorption be equal to emission. The recombination rate is the population 
inversion necessary to achieve threshold divided by the recombination rate : 


Recombination rate = — (electrons s 'em 3 ) 

TV 


where T r is the minority carrier lifetime . 


(7.35) 


We have already developed an expression for the population inversion in 


(7.26). 
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where B 21 is the stimulated emission coefficient. In (7.21) we related B 21 

the spontaneous recombination rate. This is a useful relationship to know 
because you can measure this rate directly. 
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Substituting (7.35) into (7.34), 
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The spontaneous emission rate An is determined by the inverse of the 


(7.38) 


spontaneous emission lifetime . This lifetime can be measured by exciting the 

laser material with a light pulse from an external laser emitting photons with 
energy above the band gap of the semiconductor. The semiconductor will emit 
photoluminescence that dies out with the spontaneous emission lifetime. Typical 
values for r ^i are several nanoseconds. 


The threshold current density can be expressed by combining (7.32), (7.33), 
and (7.36), 
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You would like to have a lower threshold current. There are some variables 
in this expression that are under the control of the laser designer. The thickness 
of the recombination region can be reduced physically. This was first done by 
making a heterostructure, and has been developed into the currently-used 
quantum well design, where the recombination is restricted to a potential well of 
thickness comparable to the de Broglie wavelength ; that is, about 10 nm. 

In a semiconductor laser having a bandstructure similar to that shown in 
Fig. 7.5, the gain function can be adequately represented by a Gaussian 
distribution . The value of the distribution at its maximum value can be 
expressed in terms of its full-width at half-maximum (usually abbreviated 









FWHM). 
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The exact form of the gain distribution function is almost never known. It 
can be adequately approximated by: 


£( v max) s ^ (7-41) 

The fundamental nature of stimulated emission dictates that the laser will 
want to emit light whose frequency lies as close as possible to the peak of the 
gain distribution. We can use this approximation to make a practical estimate of 
the threshold current density. 
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All the variables in this expression are easily accessible. The ratio fe) is 


referred to as the internal quantum efficiency, and is typically taken to be unity. 
The width of the recombination region, t, is controlled during fabrication. In a 
heterostructure laser this is typically about 10“ 5 cm, and in a quantum well laser, 
about one order of magnitude less ~10 -6 cm. The width of the gain spectrum in 
energy is about kT or 0.02 eV at room temperature. 


7.5 Laser Engineering 
7.5.1 Heterostructure Lasers 

The p-n junction structure of Fig. 7.7 can be improved to make a better laser in 
two important ways. One is to reduce the thickness of the recombination region ( 
-*) in (7.40), and the other is to impose a structure that confines both the emitted 

light and the electron-hole pairs to the same region in space. Confinement is the 
key concept, in emission, the presence of a photon stimulates emission of a 
second photon. The photonic field must overlap with the population inversion of 
electrons. A semiconductor heterostructure can be used to confine 
simultaneously both photons and electron-hole pairs to the same region of space. 
The simplest realization of such a semiconductor heterostructure was designed 








and demonstrated by I. Hayashi, M.B. Panish and P. Foy. It enabled room- 
temperature CW operation of a semiconductor laser in 1969 for the first time, 
making the semiconductor laser a practical reality, 7 years after first being 
demonstrated by R.N. Hall. Zhores Alferov and Herbert Kroemer shared part of 
the 2000 Nobel Prize in Physics for conceiving the use of a semiconductor 
heterostructure to transform the GaAs laser diode from a laboratory experiment 
to a useful device (Fig. 7.9). 
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Fig. 7.8 The resonant wavelengths of a laser cavity are those select wavelengths for which a standing wave 
can be established between the two mirrors at either end of the cavity. These cavity modes and are equally 
spaced in frequency (b). Only the few that happen to occur in the gain spectrum (a) of the semiconductor 





















material will participate in the necessary positive photon feedback (c) 


7.5.2 Optical Confinement—Waveguiding 

We can use Snell’s law for refraction to determine the critical angle for total 
internal reflection in a heterostructure of AL^Ga^xAs/GaAs. This effect creates 
optical waveguiding in the plane perpendicular to the direction of propagation 
that is defined by the resonator mirrors. As diagrammed schematically in 
Fig. 7.10, a ray incident on the interface between two materials at an angle of 

relative to a line perpendicular to the interface is refracted at an angle & in the 

second material. If the index of refraction in region 1 is greater than that in 
region 2, then it follows from Snell’s law that 9i>Q\. The angle for which 

^ = I radians is defined as the critical angle: &. The critical angle for total 
internal reflection for the Al 0 30 Ga 0 70 As/GaAs waveguide is: 


(a) 



Fig. 7.9 Energy-level diagram for an Al x Gaj_ x As-GaAs heterostracture diode laser, a The energy band 

edges of the p-n junction in strong forward bias. Electrons are trapped in the potential well of the 
conduction band and holes are trapped in the potential well of the valence band achieving significant spatial 
overlap of the population inversion. The width of the recombination region is fixed physically by the 
materials growth process, and can now be significantly narrower than the 300 nm lower limit of the 

homostructure diode. Note the position of the quasi Fermi level for electrons in the conduction band ( ^FC 
) and the quasi Fermi level for holes in the valence band, ( &FY ). b The index of refraction of GaAs is 
























higher than the index of refraction of Al x Gaj_ x As. Thus the GaAs recombination region acts like the core 

of an optical waveguide. Much of the recombination radiation will be confined by total internal reflection, 
so that the density of photons in the GaAs recombination region is greatly increased. This increases 
stimulated emission of radiation (reproduced from Amnon Yariv, Optical Electronics in Modem 
Communications . 1996, by permission of Oxford University Press, USA) 
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Spontaneous emission that lies within the “escape cone” of 142° is radiated 
out of the heterostructure. The direction of laser emission, is defined by the axis 
of the resonators. This is strongly guided by the heterostructure which serves to 
reinforce the ratio of stimulated to spontaneous emission in the output beam 
(Coldren et al. 2012). 
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Fig. 7.10 Spontaneous emission occurs in all directions with equal probability. Given the indices of 
refraction of GaAs and AIq 39 GagjoAs, the “escape cone” according to Snell’s law is 142°. A photon 

emitted within this cone will leave the laser structure. Otherwise, the photons are trapped by total internal 
reflection and are waveguided by the laser structure. Resonators at the ends of the laser cavity determine the 
modes that are highly reflected, and which will have the highest photon density leading to the strongest 
amplification by stimulated emission 

The simple heterostructure concept shown in Fig. 7.9 can be further 
improved. First of all, confinement of the optical mode as well as confinement of 
the electron-hole population can be achieved not just in 1-dimension, as shown 
in Fig. 7.8, but in 2-dimensions perpendicular to the direction of propagation. 











This is implemented by the buried heterostructure laser. Second, the confinement 
dimension of electrons and holes can be optimized independently of the 
confinement of the optical mode. This is called the separate-confinement 
heterostructure , or SCH. This concept leads naturally to the quantum well laser. 
We will treat both of these structures. 

7.5.3 Buried Heterostructure Lasers 

The buried heterostructure laser is an extension of the simple heterostructure 
laser. In this design the active region is surrounded on all sides by higher 
bandgap and lower index AlGaAs (Fig. 7.11). 

In the usual buried heterostructure design, the vertical dimension of the 
active region is about 200 nm, and is defined during epitaxial growth. The lateral 
dimension of the active region is typically 15 times larger and is defined by 
optical lithography (Fig. 7.12). 

The optical mode intensity is shared between the active region where it 
contributes to stimulated emission, and as an evanescent wave in the 
confinement regions (Yariv 1996). Since AlGaAs has a larger bandgap than 
GaAs, laser radiation can stimulate additional recombination only in the GaAs 
active region. As the thickness of the active region is decreased, the intensity of 
the photonic radiation in the recombination region also decreases. The sharing of 
the optical mode volume between the active region and the waveguiding region 
is characterized by the confinement factor r. The confinement factor is easily 
calculated from Maxwell’s equations, given the index of refraction profile 
(Casey and Panish 1978). The result obtained for the Al 0 3 Ga 0 7 As/GaAs laser is 

shown in Fig. 7.13. 


p + GaAs contact layer 



Fig. 7.11 Cross section of a buried heterostructure laser. Waveguide confinement and feedback assure that 











laser emission occurs in the direction perpendicular to the plane of the cross-section 


The confinement factor modifies the expression for the threshold current 
density: 
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(7.43) 


The confinement factor approaches unity when the active region thickness is 
greater than the wavelength of the laser emission in the GaAs medium: 


t > > 250 nm. 


As the active layer thickness decreases below this limit, the confinement 
factor decreases exponentially. Thus the ratio f increases, resulting in a dramatic 

rise in the threshold current density due to the reduction of the photon density in 
the electron-hole recombination region (Fig. 7.14). 



Fig. 7.12 The near-field intensity of laser emission has an angular divergence that depends on the inverse 
thickness of the active region. This is a diffraction effect similar to that which occurs when a photon passes 
through a slit (reproduced from H.C. Casey, Jr. and M.B. Panish, Heterostructure Lasers, Part A, 1978, by 
permission of Elsevier Press) 


7.5.4 Separate Confinement of Photons and Electrons 

In the double heterostructure laser design of the previous section, photons and 
electrons are confined by the same physical structure. Separate confinement of 
photons and electrons is a straightforward way to optimize optical confinement 
and carrier confinement in order to reduce the threshold current density (Chuang 
2010 ). 






The optical waveguide should be adjusted to give a confinement factor close 
to unity. We have just shown that the optimum waveguide dimension is 
approximately equal to the photon wavelength in the guiding medium. For a 
GaAs/AlGaAs laser this thickness is about 100 nm. 

Equation (7.43) shows that the reduction of the active region thickness 
reduces the threshold current. At the same this reduces the overlap of the active 
region with the optical mode. However it does not change the optical 
confinement factor. If the thickness of the active region and the waveguide 
thickness are reduced together, this gives rise to the dramatic increase in 
threshold current demonstrated in Fig. 7.14. If the thickness of the active region 
is decreased while the waveguide thickness stays constant, then the threshold 
current is increased, to be sure, but only by the ratio of the thickness of the 
optical waveguide region divided by the thickness of the recombination region. 



Fig. 7.13 The optical confinement factor is the fraction of the mode volume that is confined within the 
geometric boundary of the active region of the laser. The confinement factor increases approximately as the 
square of the active region thickness and saturates at unity when the active region thickness is greater than 
the wavelength of the laser emission in the active region 


In this case, we can define an effective confinement factor p« 


(7.44) 

Equation (7.44) shows that this optimization by separate confinement 
produces no major change in the threshold current because improvement by 
lowering the thickness of the active region is balanced by a decrease in the 




overlap of the active region with the optical mode. However, when the active 
region thickness reaches the quantum confinement regime, the physics of photon 
generation changes in a dramatic way. 

The dimension of the active region can be reduced to the quantum regime. 
This means that recombination takes place in a 2-dimensional quantum well. The 
change from 3-d to 2-d is the key to understanding why the quantum well laser 
design can reduce significantly the threshold current density. 

The number of occupied states in the conduction band k-space is the volume 
enclosed by the Fermi surface . In a direct gap semiconductor with parabolic 
bands like GaAs, this is a sphere of radius k. The volume is 

A f(k) = (7.45) 

The energy dependence on k in a direct bandgap semiconducting material 
like GaAs is parabolic near the conduction band edge where photon emission 
occurs. That is: 
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And the number of states as a function of energy is expressed; 
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The density of states per unit energy in k-space is therefore: 
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Changing from k to real space we compare the volume in k-space to that of 
real space. For simplicity and without any loss of generality, we consider the 
simple cubic crystal structure. The volume in k-space is: 
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To convert to real-space dimensions, we divide (7.47) by (7.48) which relates 
real space volume to volume in k-space. The result is the energy density of states 
per unit volume in real space. 







(7.51) 


The gain factor 7(E) is expressed as: 
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Equation (7.49) tells us that the density of states is near zero at the 
conduction band edge and rises slowly as a function of photon energy. Thus, 
laser emission cannot occur at the band edge, and additional band filling, that is 
additional pumping current density, is needed to increase the gain function to the 
point where laser action can occur. These features are summarized in Fig. 7.15. 



Fig. 7.14 The threshold current density of a double heterostructure laser has a distinct minimum value 
when the active layer thickness is equal to the laser emission wavelength (reproduced from Amnon Yariv, 
Optical Electronics in Modem Communications . 1996, by permission of Oxford University Press, USA) 


In the case of a 2-dimensional environment, such as that of an electron in a 
quantum well, the same approach yields a different result. There are 2 states for 
each value of k, but the total number of k —states is given by a the area of a 
circle having radius = k. 

In two dimensions, the procedure is the same, but k-space is now a 2- 
dimensional entity 







(7.53) 
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The number of occupied states is the area in /c-space: 


N (£) = 2* xir = 2x (E - E t: ) (7.54) 

and the density of states is: 
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That is, the density of states is a constant, independent of energy for 
electrons occupying the n = 1 quantum level. For a considerable range of energy, 
the density of states is constant, which means that the gain factor is also constant 
and independent of the drive current of the laser. This situation is advantageous 
for direct current intensity modulation bringing both stability and improvement 
in modulation bandwidth. When the energy of the electron permits occupation of 
the n = 1 and the n = 2 quantum levels, then the density of states doubles, being 
the sum of the densities of the two levels. In this new regime, the gain factor also 
doubles at this step, and then remains constant over a considerable range of 
electron-hole energies. 

Above the quantized energy level of the quantum-confined state, the density 
of states rises like a step-function to a constant value. In a quantum well 
environment, laser action can first occur at the energy of this step in the density 
of states, and this feature leads directly to a significant reduction in threshold 
current density. In Fig. 7.16, we can see how the “dead-zone” is eliminated. 
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Fig. 7.15 As current is injected in a 3-d semiconductor laser, the gain function 8 ( v 0 increases 

proportionally to the density of states. Threshold gain is reached when the device starts lasing. This means 
that there is a “dead-zone” where current is injected, photons are emitted, but the round-trip gain is 
insufficient to overcome losses 
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Fig. 7.16 In a quantum well structure, laser action can take place for photon energy greater than & l , the 
ground state energy of the quantum well. Current injected into the structure populates the lowest energy 



states at this level where the density of states is already high. As a result threshold gain can be reached as 
soon as there is current present. Because the density of states in a 2-d structure “turns on” in a step-function 
manner, there is no “dead-zone” as in the case of a 3-d laser. This results in a significant reduction in 
threshold current required to initiate laser action 


Equation (7.55) is of course an areal density of states. We can transform this 
expression into a volumetric density of states so that it can be compared with 
(7.49). To do this we divide by the thickness of the quantum well, L ~. 
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The 2-d density of states is constant everywhere in the quantum well. Thus 
when we integrate the volumetric density of states over the quantum well width, 
we recover (7.52). 


7.5.5 The Quantum Well Laser 

The separate confinement quantum-well laser takes advantage of the 
dramatically lower threshold current provided by the 2-d nature of electrons 
confined in a quantum well. Typical well dimensions that provide this 
confinement are ~10 nm for a GaAs/Al 0 30 Ga 0 70 As heterostructure. Quantum 

well dimensions larger than this will not assure the 2-d behavior of the density of 
states. Quantum well dimensions smaller than this result in the n = 1 energy level 
lying so near in energy to the top of the well that electrons are no longer well- 
confined. (so to speak!) 

There are two main approaches to remedy this result. One is to grade the 
index in the photon waveguide so that the optical field intensity is maximized in 
the quantum well region. This structure is named graded-index separate 
confinement heterostructure, or GRINSCH (Fig. 7.17). 
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Fig. 7.17 GRINSCH design. The graded-index design creates a self-focussing waveguide that enhances the 
optical field strength at the center of the wave guide where the quantum well(s) active region is located. 

This boosts stimulated emission. In this structure, the effective confinement factor depends on the simple 
ratio of the active region thickness to the width of the optical mode 


The second approach is to insert several quantum wells across the width of 
the optical waveguide. This is called a multi-quantum well laser . 

Laser engineering using these approaches has resulted in quantum well lasers 
that have a threshold current an order of magnitude lower than that which 
obtains for a double heterostructure design. 


7.6 Summary 

A laser is an amplifier with positive feedback . Amplification is generated by 
stimulated emission of photons, and positive feedback is achieved using mirrors. 

Quantum photonics concerns the exchange of energy electrons with the 
excitations of the electromagnetic field. The physics of spontaneous and 
stimulated emission of photons is directly seen though quantization of the 
electromagnetic fields. This physics is the foundation of laser action. An electron 
can be promoted from a lower to higher energy state provided that a photon is 
annihilated, having the corresponding energy difference. This is stimulated 
absorption. An electron can transition from a higher to lower energy state by 
emitting a photon having the corresponding energy difference. This transition 
can be stimulated by the presence of another photon, or the transition can occur 
spontaneously in the absence of a photon. 

The transition rate for stimulated emission depends on the number of 
photons already present in the electro-magnetic mode. The stimulated photon is 








emitted into the same mode. This stimulated emission acts like a linear amplifier 
that duplicates all the properties that define the mode of the stimulating photon: 
frequency, polarization and phase. The matrix element for stimulated emission is 
the same as that for stimulated absorption . 

Spontaneous emission can only be understood as a quantum reaction with the 
vacuum. The emitted photon emerges from the vacuum state with an energy that 
corresponds to the energy difference between the initial and final states of the 
electron. Photons created by spontaneous emission act as seeds for subsequent 
stimulated optical transitions. Under conditions of thermal equilibrium, the 
spontaneous emission rate greatly exceeds the rate of stimulated emission. On 
the other hand, population inversion, which is a highly non-equilibrium state, is 
a necessary condition to raise the stimulated emission rate above the rate of 
spontaneous emission. This leads to laser action . 

In steady-state laser operation, the absorption rate of photons is equal to the 
emission rate. The emission rate is composed of both spontaneous and 
stimulated emission. The absorption rate is due only to stimulated absorption. 
Optical gain will occur if the stimulated emission rate exceeds the stimulated 
absorption rate. Such a situation cannot occur in a two level system because the 
same two levels are responsible both for absorption and emission of photons. In 
a three or four level system, the equilibrium between absorption and emission is 
maintained, but the absorption takes place between one set of levels and the 
emission takes place between a different set. A semiconductor laser is an 
example of a four-level system. Putting a forward current through the diode 
causes recombination to occur, generating photons. Some of these photons will 
be emitted into the resonant modes of the cavity created by the mirrors. Only 
these photons will be amplified by stimulated emission. Their properties are the 
same as those of the photons that stimulated the emission. As the current is 
increased, these amplified modes will account for a greater percentage of the 
total recombination. Threshold is reached when the amplification per round trip 
in the cavity exceeds the absorption and scattering losses for the same round trip. 

Semiconductor lasers were first demonstrated by engineering stimulated 
emission generated by electrons that make a direct transition between occupied 
states in the conduction band and empty states in the valence band. This was 
soon extended to direct transitions between any two states where a population 
inversion could be engineered (Unterrainer et al. 1990). An outstanding example 
of this approach is the quantum cascade laser, which is discussed in the 
following chapter. 

A critical parameter for laser performance is the threshold current density . 
The development of the double-heterostructure laser concept by Alferov and 


Kroemer is based on confinement of the optical mode and the recombination of 
electrons and holes to the same region of space. This idea paved the way to 
room-temperature lasers and was awarded the year-2000 Nobel prize. 
Application of quantum confinement to two dimensions has dramatically 
improved this design by separate optimization of the optical waveguiding region 
and the electron-hole recombination region. This results in a steep reduction in 
the threshold current and a gain factor that is independent of operating drive 
current. 


Exercises 

1. Laser action can occur when the stimulated emission rate exceeds the 
spontaneous emission rate; (see 7.37). What would happen if you reduced 
the spontaneous emission rate to zero? Would you have a threshold-less 
laser? Explain your answer. 

2. Calculate the ratio of the spontaneous to stimulated emission rates at thermal 
equilibrium at T = 300 K, for an electron transition of 0.2 eV (infrared) 4 eV 
(near UV) 

3. Estimate the threshold current density in A cm -2 of a GaAs-based laser with 
the following properties: 

Emission wavelength = 850 nm 

Linewidth of the gain spectrum = 1.5 x 10 13 Hz 

Internal losses = 30 cm -1 

Index of refraction = 3.5 

Cavity length = 400 pm 

Thickness of the recombination region = 200 nm 

4. Compare threshold of GaN laser to GaAs laser. What can be changed? 
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Abstract 

The quantum cascade laser represents an important accomplishment in quantum 
photonics, combining quantum excitation of the electromagnetic field, electron 
transport and controllable quantum mechanical tunneling all working 
cooperatively in a single device. In cascade lasers, the carriers are “recycled” 
from one stage to the next, so that the evacuation region of stage 1 is connected 
to the injection region of stage 2. A single carrier traveling through the structure 
may emit a photon at each stage. The successful operation of the quantum 
cascade laser is a balancing act between population inversion maintained by 
resonant phonon evacuation of carriers from the lower level and non-radiative 
depletion of the upper level population caused by optical phonon scattering. The 
interband cascade laser eliminates this major difficulty of the quantum cascade 
laser, by using a conduction band to valence band optical transition to generate 
the laser emission. As a result, the threshold power density in the interband 
cascade laser is lower by an order of magnitude compared to the quantum 
cascade laser. 


8.1 Introduction 

The quantum cascade laser , like its cousin, the interband cascade laser, unites 
purely quantum behavior like tunneling with photonics and electronics, creating 
a kind of concerto for quantum photonics . The quantum cascade laser (QCL) 
and the interband cascade laser (ICL) are commercially-available components, 




and they demonstrate that active quantum effects like tunneling can be used in a 
controllable way at room temperature with macroscopic effects. 

These devices are based on classic laser principles: a gain region 
implemented by population inversion, inside an optical resonator that creates 
positive feedback for optical frequencies within the gain spectrum. In addition 
there is a carrier injection region and a collector evacuation region. Laser action 
requires that the optical recombination lifetime be longer than the pumping and 
evacuation lifetimes , enabling population inversion . 

In cascade lasers, the carriers are “recycled” from one stage to the next, so 
that the evacuation region of stage 1 is connected to the injection region of stage 
2. A single carrier traveling through the structure may emit a photon at each 
stage. Thus the same carrier may emit several photons as it traverses the device 
structure. Unlike a p-n junction laser, the quantum efficiency of a cascade is in 
general much greater than unity. As always energy is conserved. A better figure 
of merit is the “wall plug” efficiency which is the ratio of the emitted optical 
energy to the electrical energy consumed. The wall plug efficiency is always less 
than unity. 

The cascade laser represents an important accomplishment in quantum 
photonics, combining quantum excitation of the electromagnetic field, electron 
transport and controllable quantum mechanical tunneling all working 
cooperatively in a single device. 

There are two varieties of this device: interband cascade lasers (ICL) where 
photons and laser action are created by electron-hole interband recombination. 
Laser action in the ICL follows the analysis presented in Chap. 7. The cascade 
structure enables the construction in series of multiple optical gain sections, but 
does not play an important role in the active regions of the device. In the 
intersubband cascade laser (QCL), photons and laser action are created by 
transitions between quantum well states in the conduction band. The 
intersubband quantum cascade laser is a unipolar device. In this chapter we will 
consider primarily the intersubband quantum cascade laser. 

In Fig. 8.1a we show a schematic diagram of potential energy versus distance 
for a series of quantum wells. In Fig. 8.1b we show how this potential 
distribution changes under an applied electric field E. Note that for a specific 
value of the applied electric field, the n : level of the jth well can be made 

degenerate with respect to the n 2 level of the j + 1th quantum well. This situation 
enables resonant tunneling to occur between the jth and j + 1th wells. 




Fig. 8.1 Schematic diagram of a superlattice potential in the direction perpendicular to the plane of the 
layers forming the superlattice, a Potential distribution as a function of distance. The periodicity of the 

superlattice is a . The energy separation between the subbands n \ and n 2 in the same quantum well is 
A . b In the presence of an electric field E , the bands are tilted by the applied voltage The 

difference in energy between the subband and n 2 in the adjacent quantum well is given by ^ 


The quantum cascade laser uses voltage-tunable tunneling to transfer 
electrons from the ground state quantum level in a quantum well to a higher 
quantum level in the adjacent quantum well. This electron can then relax to a 
lower quantum energy level by emitting a photon. And, it can then be injected by 
tunneling into the next stage, allowing the above procedure to repeat for as many 
such stages as make up the structure. 

Shortly after the announcement by Hayashi et al. (1970) of a semiconductor 
laser operating continuously at room temperature, Kazarinov and Suris in 1971 
proposed an optical amplifier (laser) based on photon-assisted tunneling between 














































levels in a multi-quantum well structure. The core of their idea was to exploit 
electron tunneling from a level in one quantum well into a virtual state in the 
forbidden region of an adjacent quantum well, driven by an applied bias voltage. 
In the absence of an external photon flux, the tunneling probability is near zero. 
They postulated that in the presence of photons having an energy e equal to the 

difference between the energy of the virtual state and of that the next lowest 
quantum well state would enhance the tunneling probability and stimulate the 
emission of an additional photon by laser action. In Fig. 8.2, we show a 
schematic diagram of this proposal. 



Fig. 8.2 In the proposal of Kazarinov and Suris of 1971, an electron would tunnel from a stable quantum 
well state into a virtual state in the adjoining well. In the presence of a photon the energy of which is equal 

to £ , the electron transitions to a final allowed state in the adjoining well by stimulated emission of a 

second photon of energy ^ . The energy £ is determined by the applied voltage, and so the proposed 
photon amplifier would be by tunable in energy 

While a device based on the proposal of Kazarinov and Suris has yet to be 
demonstrated, it does contain important elements of working quantum cascade 
lasers. 

In Fig. 8.3 we show a typical structure for a quantum cascade laser (Sirtori et 
al. 1996). As shown in the figure we can separate the structure into 3 regions: 
carrier injection, photon emission, and carrier evacuation. We will consider each 
of these regions separately. The prime requirement for a laser is the generation of 
stimulated emission. Stimulated emission requires population inversion between 
the energy levels where photons are generated in the photon emission region . 
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Fig. 8.3 Diagram of a QCL showing the injector region, emission and evacuation regions. In this structure 
we show emission of a photon via an intersubband transition within the same quantum well. This is one 
among a wide variety of structures in which QCL operation has been demonstrated 


8.2 Requirements for an Intersubband Laser 

Intersubband optical transitions were discovered and characterized by Gornik 
and Tsui (1976). A laser based on intersubband optical transitions is above all a 
laser, and the basic laser requirements still apply: 

• Population inversion: 

Laser action requires a population inversion between two levels. In Chap. 7 
we proved that this requires at least 3 energy levels for electron excitation , 
photon emission and electron evacuation . The lifetimes of the transitions 
between these states must assure a population inversion. 

We will label as the lifetime for carriers to resupply the upper level for 

emission. The lifetime for photon emission will be labelled T i and the scattering 
time to evacuate electrons from the lower level will be labelled r 3. A basic 

requirement for laser action is that the lifetime for photon emission be long 
enough to ensure that the upper level is populated and that the lower level is 
depopulated, (refer to (7.31)) The lifetime for photon emission in the photon 
generation region must also be less than the phonon scattering time, in order to 
minimize dissipation of the electron energy in the upper state by non-radiative 
inelastic scattering (Faist et al. 1994). 










































r, < t 2 

Tj < T 2 


( 8 . 1 ) 


and 

• Resonator 

Emitted photons must be re-circulated in the active region in order to 
stimulate further emission. The resonator can be formed by the usual cleaving 
process to provide two parallel reflecting faces, or by a distributed feed back 
grating, the same as the case for a conventional semiconductor laser. 

• Gain is greater than round-trip losses 

Laser action occurs when photon gain exceeds the round-trip losses, 
typically due to waveguide imperfections and optical phonon scattering. 

In the following sections we will see how the design of a working quantum 
cascade laser satisfies the criteria for laser action. 


8.3 Injector Region 

The injector region consists of a series of quantum wells containing a reservoir 
of electrons, formed by heavy doping ( > 10 12 cm 2 ). The reservoir is 

connected to the active region and to the evacuation region by a series of 
quantum wells. The structure shown in Fig. 8.3 is based on a working design, 
consisting of 9 quantum wells with dimensions between 1.7 and 3.5 nm. (The 
complete specifications of this structure are given by Jerome Faist (2013) 
Appendix A) In the structure under consideration, the total width of this region is 
about 30 nm. 

We recall from Chap. 3 that sequential quantum wells have resonant levels 
that form energy minibands with finite width rather than sharp levels, as shown 
in Fig. 8.4. Such minibands facilitate transport by tunneling for two reasons. The 
first is that the range of voltage for which efficient transfer occurs is broader, 
which favors stable transport of electrons. The second reason is related to the 
wavefunction of the electron. 

In the calculations of free-electron tunneling in Chap. 3, we assumed that the 
electron wavefunction has a single, well-defined /(-vector. This assumption is 
needed to calculate accurately the energy levels in quantum wells (Harrison 
2006). However, the wavefunction of real electrons is localised in both space and 
time, and it is expressed as a wave packet: a superposition of free-electron 



wavefunctions: 


T{k 1 r) = ^^ k e' llf (8.2) 

When an electron represented by such a wavepacket is incident on a 
sequence of quantum wells, it will be transmitted or reflected depending on the 
relationship of each of its component wavevectors with the transmission 
coefficient of the quantum well structure. It is obvious that a quantum well 
structure with a miniband structure will transmit a wavepacket more completely 
than that of a single resonant quantum well (refer to Fig. 3.17) (Fig. 8.4). 



Fig. 8.4 The transmission coefficient for a sequence of 10 quantum wells having well width = barrier 
width = 2 nm. The well material is Gao. 47 Ino. 53 As and the barrier material is Alo. 4 gIno. 52 As. The energy 
level structure is a miniband having a width of -0.3 eV 


When the bias voltage aligns the levels so that electrons in the reservoir 
region can tunnel into the photon region, electrons are driven into the photon 
generation region by carrier diffusion , the direct result of the carrier 
concentration gradient between the two regions. When a carrier disappears due 
to a photon generation transition, it is resupplied by diffusion. This time is 
estimated from the diffusion relation: 


L 2 _ qL 2 
2D ~ IqkisT 


(8.3) 


(the factor 2 is appropriate for 1-dimensional diffusion). 

The diffusion time depends on the square of the distance, and for rather small 





distances appropriate for quantum wells, this time can be very short. For 
example, using parameters appropriate for electrons in GaAs: 

/j e a; 5000 cm 2 V 1 s _l 
k b T = 0.026 cV 
L = 50 nm 

We can estimate: 

r, ^ 1<T 13 s (8.4) 


8.4 Electron Evacuation from the Lower Level 

Before considering the photon generation region, we will treat the evacuation of 
electrons from the n = 1 level of the photon generation region. In order to ensure 
population inversion, this level must be empty of carriers relative to the 
population of the n = 2 level. Electron-phonon scattering is an efficient 
mechanism to transfer electrons out of this lower level, provided that there is a 
suitable final state for them to scatter to. This is provided by a second quantum 
well adjacent to the photon generation region. 

Phonons are mechanical vibrations of the crystal lattice. In a semiconductor 
like GaAs with two atoms per unit cell, there are two basic phonon types: 
acoustic and optical modes. Acoustic phonons correspond to large scale 
compressive or shear waves. Adjacent atoms are displaced in the same direction, 
and there is little relative movement between nearest neighbors. 

Optical phonon excitation describes vibrations where nearest neighbors are 
displaced opposite to each other and their relative motion is considerable. 

Optical phonons have polarization. Transverse optical phonons have 
displacement amplitude that is perpendicular to the direction of propagation. 
Longitudinal optical phonons have displacement in the direction of propagation. 
The two ions making up the unit cell of III-V semiconductors carry different net 
charge, and the relative motion of these ions creates an electric displacement 
field that interacts strongly with electrons. On the other hand, acoustic phonon 
excitation does not create as significant a displacement field, and their 
interaction with electrons is less significant. 

Electron evacuation is achieved by inelastic (non-reversible) scattering of the 
electron from its state following photon emission to another band lying lower in 
energy. Scattering by optical phonons is an efficient way to achieve this 
objective because the phonon scattering rate is an order of magnitude larger than 
the photon transition rate. Thus an electron that makes an intersubband optical 




transition can be scattered out of the lower energy subband at a rate that far 
exceeds the arrival rate of additional electrons, preventing population build-up in 
this state. Scattering by optical phonons is accompanied by energy loss. The 
optical phonon energy is transferred to the crystal lattice as heat. Under most 
conditions, this energy is large enough to ensure that the scattered electron 
remains in the lower energy band, but small enough, relative to the intersubband 
photon energy, to have only a moderate effect on device efficiency. 

Electron-phonon scattering is mediated by the interaction between the 
electron and the piezo-electric potential created by local lattice deformation. The 
Hamiltonian for this interaction is written (Ziman 1967): 

'K<- P * = ~ J Disp (r) Polar (r) dr (8.5) 

Equation (8.5) makes it clear that only phonons having a polarization aligned 
with the electron displacement vector will contribute to scattering. These are the 
longitudinal optical phonons . The transverse phonons are polarized 
perpendicular to the displacement. This feature: a single polarization, along with 
the constant energy of the optical phonon as a function of wavevector greatly 
simplifies treatment of optical phonon scattering. 

An optical phonon can be approximated as an excitation having a finite 
constant energy for a wide range of wavevectors from Q = 0 to the maximum 
defined by the unit cell boundaries. 

This means that highly efficient electron-phonon scattering out of the n 2 - 
level into the n r level can take place with a constant energy shift and 
conservation of momentum ( - ^ = Qop, where k is the wavevector of the 

electron). To appreciate this, we recall that a quantum well is a 3-dimensional 
object. Although there is strong confinement in the z-direction, the electron can 
move freely in the 2-dimensional space orthogonal to the direction of strong 
confinement. In Fig. 8.5 we show a schematic diagram of the energy-momentum 
relationship in the x-y plane. An electron at the lowest energy state i.e., 
k x = 0. k y = 0 in the upper band can change its state in one of three ways. 
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Fig. 8.5 If the energy spacing A between the 112 ' and nj' level is at least as large as the optical phonon 
energy, then highly efficient electron-phonon scattering can be harnessed to evacuate electrons from the 112 ' 
to the n i r level 


1. Intraband phonon scattering: The electron scatters to another state in the 

upper subband by absorbing an optical phonon and gaining the optical 
phonon energy = . 

2. Direct intersubband optical transitions: The electron can make a direct 
optical transition to the lower subband emitting a photon with energy = a. 

3. Interband phonon scattering: The electron scatters to another state in the 

lower subband by emitting an optical phonon and giving up the optical 
phonon energy = . 

It is known from measurements that the optical phonon scattering time 
(~KT 10 s) is an order of magnitude shorter than the photon recombination time 
(1CT 9 s). So, phonon scattering dominates. 

The relative probability of phonon absorption to phonon emission can be 
estimated using Boltzmann statistics 

Nabsorption l _„ 

-yr 1 — = e ir (8.6) 

For example, in the case of GaAs, ^ j op = 0.029 eV At room temperature, we 
estimate intersubband phonon scattering to be about 20 times more likely than 










intrasubband scattering. 

In Fig. 8.5 it is also apparent that that an electron can scatter from the upper 
subband to the lower subband by phonon emission only if A > This puts a 
critical constraint on the design of the quantum well dimension . 

^'ni/ ■* > (8.7) 

The phase space for the final state is described by a circle of states lying on 

the energy surface of the lower subband. The magnitude of the momentum 

transfer is given by (see Fig. 8.6a): 

l£?l = - 2 kfa cos 6" (8.8) 

For example, the optical phonons in GaAs have an energy of about 0.029 eV 


(a) 



(b> Energy 



Fig. 8.6 Electron phonon intersubband scattering. The phase-space for optical phonon emission — 

consists of a 2-dimensional locus of points (shown schematically as a circle ) in the plane parallel to the 
quantum wells, a Scattering angle between the initial and final states, b Relationship between the optical 

phonon energy = and the intersubband energy difference = A 

. An electron in the lowest energy state in the n 2 quantum level can transition 
to any one of a number of states in the % level, lying lower in energy by 
0.029 eV (Fig. 8.6b). 

Electron-phonon scattering is the result of the electric field displacement 
vector of the electron interacting with the polarization of the phonon vibrational 
mode. 

Phonon excitations in a crystal can be described just like photons as an 
ensemble of harmonic oscillators. The Hamiltonian can therefore be written 












(8.9) 


using creation and annihilation operators as: 

The phonon energy spectrum can be written down immediately as 

£ = + ( 8 - 10 ) 

Electron-phonon scattering is mediated by the interaction between the 
electron and the piezo-electric potential created by local lattice deformation. 

The matrix element for the interaction can be written in the usual way: 

M = (8.11) 

Using normal coordinates and raising lowering operators, this can be written 

At = < n d J ^ r A (r - \n Q + 1) (8.12) 

where A ( r ~ 0 gives the displacement from equilibrium, 1 being the 
equilibrium coordinate, and ' is the wave associated with the phonon. 
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riQ + 1 




\2NVm*h(ij} 

within a reciprocal lattice vector. 1 (*'*') is a form factor: 


(8.13) 

(8.14) 


I (k. k f ) = - p, rp (k-k) J ^m k dr ( 8 . 15 ) 

and is zero for transverse polarizations. So only longitudinal phonons, where 
the polarization is aligned with displacement, interact strongly with electrons. 

The electron-phonon scattering rate can be expressed using Fermi’s golden 
rule : 

77. = T Z l ' w “' 1 ,Pi >| 2,s ' ( £ - E (*i) ~ “) 

" l- 
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(8.16) 



(8.17) 



where, 

J_l_ 

Now we can see that resonant scattering (that is, Q is the key (Fig. 8.7). 

To favor population inversion and stimulated emission, we want to maximize 
the scattering rate out of the upper subband to the lower subband, and this will 
occur as 2^0; that is, when the energy separation between the two bands is 

equal to the optical phonon energy. 

In the quantum cascade laser, the condition for resonant phonon scattering is 
the key to efficient evacuation of electrons from the n = 1 level. If this condition 
is implemented in the same quantum well where photon generation occurs, it 
will put an unwanted restriction on the energy of the upper level. Thus the final 
state for electron-phonon scattering should be located in a quantum well 
spatially adjacent to that in which the photon generation is achieved, as shown in 
Fig. 8.3 (Beck et al. 2002). An alternative is to place the upper level in a separate 
quantum well “upstream” from the well containing the lower energy state for 
photon generation. Then resonant phonon scattering to evacuate this state can be 
incorporated in the same well as the lower energy state. Successful operation of 
QCL has been demonstrated using both structures. 
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Fig. 8 .7 Resonant electron-phonon intersubband scattering occurs when the momentum transfer between 


the initial and final state IQI = - M « 0 . In this regime, the scattering rate is significantly increased 
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The energy alignment of the n = 1 band in well 1 with the n' = 1 level in well 
2 is diagrammed in Fig. 8.8. The energy difference depends directly on the 
applied electric field and the physical separation between the wells. As the 
applied electric field is increased, the energy difference between level n = 1 in 
well 1 and n' = 1 in well 2 increases in proportion. When this energy difference 
is equal to the LO phonon energy, (=0.029 eV for GaAs and 0.034 eV for 
Ga 0 47 In 0 53 As) we can expect that any carriers in the n = 1 level will be 

transferred efficiently to the n' = 1 level in well 2. This situation creates, as a 
result, a population inversion between the n = 2 and the n = 1 levels in well 1. 
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Fig. 8.8 Alignment of quantum levels in the photon generation region, a At 0 applied bias, quantum 
confinement causes the n' = 1 level to lie 0.066 eV higher than the n = 1 level in well 1. b In the presence of 

an applied electric field of 1.2 x 10^ V cm the n' = 1 level now lies below the n = 1 level by 0.034 eV. 
This design enables independent choice of the energy separation between the states n = 2 and n = 1 states, 
thus fixing the photon emission energy. It also enables tuning the energy difference between the n = 1 and 
the n' = 1 states to achieve resonant phonon evacuation of the n = 1 level 























Example: 

Estimate the electric field needed to align the levels n = 1 and n' = 1 for the 
structure shown in Fig. 8.3 

In this structure composed of Al 0 48 In 0 52 As and Ga 0 47 In 0 53 As, the distance 

between the center of well 1 and well 2 is 8 nm. In well 1, the n = 1 level occurs 
at 0.10 eV above the bottom of the well. In well 2, the n' = 1 level occurs at 
0.166 eV above the bottom of the well. At 0-applied field, the energy difference 
AE = 0.1 - 0.166 = -0.066 cV, By applying an external bias such that the potential 

difference between the two wells is 0.10 eV, we see that the n = 1 level now lies 
higher than the n' = 1 level by 0.034 eV, which is the energy of the LO phonon. 

The applied field £ = x v cm ~‘ We note that this field is close 

to the electric field that will cause breakdown by direct electron tunneling from 
the valence to conduction band in Ga 0 47 In 0 53 As ~2.5xl0 5 V cm -1 . 

The applied voltage per section would be V = 1.2 x 10 7 V m -1 * 5 x 10 -8 
m = 0.6 V per section. 

In the particular structure of the example above, we estimate that a bias 
voltage of 0.6 V per quantum cascade section is needed to create the conditions 
for resonant LO phonon scattering of electrons out of the n = 1 state in well 1, 
and creating a population inversion enabling laser action. Increasing the bias 
beyond this point will not improve the population inversion situation. In 
addition, because carrier supply is driven by diffusion, rather than by the 
external electric field, there is no benefit to be gained by increasing the bias 
voltage once conditions for lasing have been reached. This represents a 
significant difference in behavior that distinguishes the QCL from a conventional 
interband diode laser. 


8.5 Photon Emission Region 
8.5.1 Introduction 

In this section we will develop two important features of intersubband optical 
transitions. 

1. The quantum mechanical matrix element for optical transitions between 
subbands created by 1-dimensional quantum confinement is non-zero when 
the difference between their subbands indices is an odd integer. 




2. The magnitude of the transition element is comparable to that for band-to- 
band recombination in p-n junction lasers. 


8.5.2 Intersubband Optical Transitions 

The probability of an optical transition between the states "ft and is given by 
the square of the dipole matrix element. The dipole matrix element f 1 between 
two electronic states in a quantum well is defined as: 


M 2 -*i 


Ljl 

f CVMart.mdt- 


Ljl 

j %q^ 2 d Zf 


(8.18) 
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where l is the width of the well. 

The oscillator strength / of a dipole transition gives the relative strengths of 
the optical transitions for absorption or emission between two states. 


hi - 


2m e 


(£i-£,)<M 2 _i) 3 


The transition rate is determined using Fermi’s golden rule 


(8.19) 
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The energy of an electron in a quantum well state is expressed: 





where K ~ i and & = the transverse wavevector. The energy difference 
between two transitions is: 


( 8 . 20 ) 

( 8 . 21 ) 


Em E *" 2L 2 m* ( 8 * 22 ) 

In Chap. 3 we determined the wavefunctions for the electronic states in an 
infinitely deep quantum well as follows: 
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The dipole matrix element between the states n = 1 and n = 2 is: 


(3.13a) 


(3.13b) 






(8.23) 


This integral can be solved in a straightforward if somewhat tedious manner, 
but it is more interesting to view the calculation of the dipole moment 
graphically. First we will consider the transition between the n = 2 quantum level 
and the n = 1 quantum level. The dipole matrix element has been calculated 
using (8.10) and is non-zero. In Fig. 8.9, we show why this must be the case. 

The solution of Schrodinger’s equation gives the electron wavefunction for 
each level. It is apparent from inspection of (3.13a and 3.13b), that each solution 
is orthogonal to all the others over the range _ j ^ ^ ^ j. In Fig. 8.9, for example, 

it can be seen that the n = 2 wavefunction is antisymmetric about the center of 
the well; whereas the n = 1 wavefunction is symmetric. Their product results in a 
function 4fci that is as much negative as positive, (i.e. antisymmetric) so that the 

integral of this function is zero when taken over the range of z. However, the 
calculation of the dipole matrix element involves multiplying the function 4fei by 
qz. The result is a function that is positive across the entire range of z. Thus the 
dipole matrix element is non-zero. 
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Fig. 8.9 Diagram of the functional elements in the calculation of the dipole matrix element between the 
n = 2 and n = 1 levels in a quantum well. The bold curve shows the function that is integrated to give the 
dipole matrix element 









We can compare this result with the dipole matrix element for the transition 
between the levels n = 3 and n = 1. The elements of this calculation are 
diagrammed in Fig. 8.10. 
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Fig. 8.10 Diagram of the elements of the dipole matrix element between the n = 3 level and the n = 1 level 
of a quantum well. The dipole matrix element function, shown in heavier print, is antisymmetric with 
respect to the centre of the well and the value of its integral over the domain is zero 


The result of the calculation of the dipole matrix element for the transition 
-» «i is clearly equal to zero. That is, the transition is forbidden. The selection 

rule governing optical transitions in centro-symmetric quantum wells is the 
following: 

Optical transitions between confined states n and m in quantum wells are 
allowed if and only if their quantum numbers differ by an odd integer. 


We also note the important result that the polarization of the emitted or 
absorbed photons is parallel to the ^-axis. 

The oscillator strength of the transition between the state n = 2 and n = 1 can 
be evaluated from (8.19) 
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which was first derived by West and Eglash (1985). The optical transition 
rate is proportional to the oscillator strength. Since the oscillator strength for a 


(8.24) 










band-to-band transition in GaAs is also close to unity, it follows that the 
transition rate for spontaneous photon emission between the n = 2 and n = 1 
subbands will be similar to that for a bulk band-to-band transition. In the case of 
GaAs this rate is approximately 10 8 s _1 (see ‘t Hooft et al. 1987). 

Photon generation is diagrammed schematically in Fig. 8.11. Photon 
generation is composed of spontaneous emission ( T 2 \) and stimulated emission ( 

Tv™ ) 5 and thus 
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Fig. 8.11 Photon generation region of a quantum cascade laser. Electrons are supplied the n = 2 subband by 
diffusion. Electrons in the n = 2 band can give up their energy by transitions to the n = 1 subband, either by 
emitting a photon (spontaneous or stimulated emission) or a series of phonons (dashed line). Electrons in 
the n = 1 level can transition by resonant phonon scattering to the n' = 1 subband in the adjacent quantum 
well 
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Reviewing the conditions for population inversion, we have: 


(8.25) 


Ti < T 2 

(8.26 

t 3 < t 2 v 

For the structure under consideration, we have shown that 

r, ^ 1(T 13 s 
t 2 = 10 -B s 

r 3 = 5x HT" s (rcson ant optical phonon scattering) 

So the inequalities in (8.26) are satisfied. 

There is an additional condition which is that the optical transition between 
n = 2 and n = 1 levels take place at a faster rate than other competing processes 
that can depopulate the n = 2 level. The most important of these is intersubband 












scattering by optical phonons between the two subbands. This means that r 2 
should be less than the non-resonant optical phonon scattering time (Fig. 8.12). 

Energy Energy 


n ? 


n t 



Fig. 8.12 The two main pathways for energy relaxation from the n = 2 to the n = 1 subband are photon 
generation and sequential optical phonon scattering. In a transition where a photon is emitted, the energy 
relaxation takes place in one step. Optical phonon scattering can take the electron from the n = 2 to n = 1 
subband, and sequential intraband optical phonon scattering in the n = 1 subband takes place until the 

electron reaches the energy minimum at ^=(0 0 0 ) 


t 2 < r 0 p = 10 9 s (no n-rc son ant optical phonon scattering) 

We can see that this condition appears not to be satisfied. 


8.5.3 Spontaneous and Stimulated Emission 

The spontaneous emission rate = T has been developed in Chap. 7. The ratio 
of spontaneous to stimulated emission near equilibrium is given in (7.24). 
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For the quantum cascade structure under consideration, 


room temperature, the spontaneous emission rate is about 10 5 greater than the 
stimulated emission rate. While the spontaneous emission rate remains constant, 
the stimulated emission rate is variable as demonstrated in (6.18): 
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where I is the intensity of the lasing mode. 


(7.19) 

















As soon as current is present in the structure, photons are generated leading 
to stimulated emission, which increases the stimulated emission rate = tJLt 

With no external electric field applied, the ratio of the number of carriers in 
the level n = 2 to that in level n = 1 is given by the Boltzmann relation; 

— = c~TF = 10 - ^ for an energy difference of 0.3 cV at room temperature. 

H\ 

By applying an electric field , this situation is dramatically changed. As soon 
as the applied bias is sufficient to empty carriers from the n = 1 level by resonant 
phonon scattering to level n' = 1, population inversion occurs. This non¬ 
equilibrium situation enables stimulated photon emission into the lasing mode. 
As the photon population in the lasing mode increases, so does the stimulated 
emission rate, which quickly becomes greater than the optical phonon 
intersubband non-resonant scattering rate. That is: 

r tlim < t qp ^ 10~ 9 s (non-resonant) (8.27) 

Now all conditions for laser action are satisfied. 


8.6 Current-Voltage and Light-Current Relationship 

The quantum cascade intrasubband device operates in a quite different way from 
the interband laser discussed in Chap. 7. In this device, population inversion is 
achieved by structural design that creates the condition for efficient evacuation 
of electrons from the lowest lying state in the photon generation region. In the 
interband laser, population inversion is obtained by injection of minority carriers 
into a recombination region. 

The quantum cascade intrasubband device performance is a function of the 
applied electric field which plays a critical role in aligning the quantum well 
levels in the structure. Once this electric field is reached, electrons can move 
efficiently in the structure, and each electron can, in principle, generate one 
photon in the photon generation region of every stage of the structure. In this 
situation, the current needed to create the threshold photon flux (where optical 
gain exceeds round-trip losses ) is reduced by the number of cascade stages in 
the structure. Although this increases the operating voltage, the cascade feature 
enables laser operation at a lower current and reduced power dissipation. 

In the interband laser, electric field plays no role in determining laser 
operation. With the p-n junction in forward bias, the electric field in the laser 
device remains well below the threshold for band-to-band breakdown. The laser 
light output is proportional to the injected current, and ohmic heating limits the 



output optical power. 

Once lasing is achieved in a quantum cascade laser, increasing the electric 
field further will cause the energy levels to be misaligned, and optical output 
power will saturate and eventually decline. In the example studied above, the 
electric field needed to optimally align the energy levels is already close to the 
breakdown electric field where electrons in the valence band are ionized directly 
into the conduction band of the host material. 


8.6.1 Rate Equations 

Treating one period of the cascade laser structure, 


^" 2 = Z _ " 2 (_ L _) _ (8.28) 

where 7 ^ = * 2 ! + w + w + ' " includes spontaneous emission, plus all the 
scattering channels from state n = 2, and $8>» ( n - ~ n 0 represents electron 
depopulation from state n = 2 by stimulated emission . 
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where P is the fraction of the spontaneous emitted photons in the lasing mode. 
The quantity Qn-ton represents all the round-trip optical losses . 

In steady state, the time derivatives are all 0. 

Re-arranging (8.28): 
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Solving (8.29) for the quantity 
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In the sub-threshold regime, where s 58 d, (8.33) can be simplified to 


An = J -^- / 1 - - n T rm (8.34) 

This is population inversion “by design”. It is evident that the evacuation 
time from the n = 2 to the n' = 1 level must be much shorter than the combined 
effect of optical phonon scattering and spontaneous emission in order to achieve 
population inversion. 

We can examine the condition for threshold using (8.30). 
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For a well-designed quantum cascade structure, (8.37) can be simplified to: 
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8.7 Interband Cascade Lasers 

The interband cascade laser (ICL) has a structure based on an assembly of 
quantum wells. Unlike the quantum cascade laser, the ICL is a bipolar device. 
Both electrons and holes are required for charge transport and photon emission. 
Laser action is achieved by recombination of conduction-band electrons with 
valence-band holes. In the conventional laser diode that we discussed in Chap. 7, 
electrons and holes are injected into the recombination region using a p-n 
junction. In the ICL, there is no p-n junction. In the ICL, holes are created by 
direct tunneling of electrons from the valence band to conduction band, similar 
to the situation of the breakdown by tunneling of p-n junctions that was 
discussed in Chap. 3. However, instead of using an electric field to align the 
energy valence band and conduction band, the ICL exploits the unusual energy 
band alignment of the constituent materials: GaSb and InAs that are used to 
make the quantum-well structure (Fig. 8.13). 
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Fig. 8.13 GaSb, InAs and AlSb have the same crystal structure and nearly the same lattice parameter . In 
addition, they span a wide range of band gap energy. Thus they make an interesting system of materials for 
devices based on quantum confined energy states. However, their energy band alignment is quite unusual. 
GaSb/AlSb forms a type-I band alignment with the conduction and valence bands of GaSb lying inside 
those of AlSb, similar to GaAs/AlAs or GalnAs/InP. The pair AlSb/InAs shows a straddling alignment. 
However, a unique situation arises with the alignment between GaSb and InAs. The valence band energy of 
electrons in GaSb lies above the conduction band of InAs. This enables the direct tunneling of valence-band 
electrons from GaSb to the conduction band of InAs. This is a key component of the Interband Cascade 
Laser 

Efficient laser action depends on the level of stimulated emission. In the case 
of the QCL, the degree of stimulated emission is determined by a competition 
between the rates of intersubband optical transitions and non-radiative losses 
through phonon emission and tunneling escape from the active region (8.25). In 
the ICL design, these particular non-radiative losses are dramatically reduced by 
causing the optical recombination to occur between the lowest-lying energy state 
in the conduction band and the highest-lying state in the valence band, just as in 
the case of a conventional quantum-well laser. Phonon scattering between 
intersubband states is no longer a concern. This results in a recombination 
lifetime for the ICL which is similar to that for a conventional quantum-well 
laser, and about 2 orders of magnitude larger than that for QCL devices. This 
longer recombination time reduces the electrical pumping rate needed to reach 
population inversion. This in turn lowers the laser threshold and increases the 
available output power. 

8.7.1 ICL Active Region Design 

GaSb (Eg = 0-78 eV) an d InAs ( Eg = 0.36 cV) are direct bandgap zinc-blende 

semiconductors which have nearly the same lattice parameter (Table 8.1). 
Extended heterostructures of these material can be grown without introducing 
strain and strain-related defects. Both carrier confinement and optical 


> 0,1eV 

AlSb 











confinement are assured by barriers of AlSb that surround the region of electron- 
hole recombination. Optical emission occurs between the conduction band 
minimum of the quantum-confined state in InAs and the valence band maximum 
of the quantum-confined state in GaSb (Fig. 8.14). 






Optical transition between 
quantum-confined states 


Fig. 8.14 Active-region structure of the GaSb/InAs ICL. Quantum confinement raises the bottom of the 
conduction-band in InAs above the valence-band maximum in GaSb (These are indicated by horizontal 
black line segments.) This creates an optical transition that is direct in K-space, but indirect in real space. 
Both energy and momentum are conserved. The optical matrix element depends on the spatial overlap of 
the wavefunctions in InAs and GaSb 


Table 8.1 Material parameters for semiconductors used in fabrication of interband cascade lasers (ICL) 


Material 

Temp (K) 

Egap(eV) 

aQ ? A lattice constant 

VB offset (eV) 

m e 

m hh 

m ih 

GaSb 

295 

0.78 

6.095 

0 

0.041 

0.4 

0.05 

InAs 

295 

0.36 

6.058 

-0.51 

0.023 

0.41 

0.026 

AlSb 

295 

1.61 indirect 

6.14 

-0.41 

0.12 

0.98 



8.7.2 Hole Injection by Tunneling 

Optical recombination and laser action require the presence of holes in the 
valence band of GaSb and electrons in the conduction band of InAs, leading to 
population inversion. Consider first the introduction of holes in the valence band 
of GaSb. The hole injection structure is shown in Fig. 8.15. 









































Electron tunneling from tine valence band 
of GaSb to the conduction band of InAs 



Hole 


Electron 

conduction 


conduction 


Fig. 8.15 Hole injection into the valence band of GaSb is achieved by resonant tunneling . The horizontal 
black line indications the energy of the confined valence band in p-type material and the bottom of the 
confined conduction band in n-type material. The conduction band of bulk InAs lies lower in energy than 
the top of the valence band in GaSb. In the quantum-well structure the valence band state of GaSb of GaSb 
can be brought within near resonance with the conduction band state in InAs. By using an applied electric 
field these two states can be made resonant implementing tunneling 


Design involves the determination of the widths of the layers of GaSb, InAs 
and AlSb that are required to give the desired progression of quantum confined 
energy states in the GaSb valence band and the InAs conduction band. On 
application of an external electric field, these levels can be made resonant. 
Tunneling of electrons takes place from the valence band of GaSb (where the 
density of occupied initial states is high) to the conduction band of InAs (where 
the density of occupied states is lower). This creates a hole population in the 
valence band of GaSb and an electron population in the conduction band of 
InAs. In an extended structure, the applied electric field will separate electrons 
and holes, leading to a population inversion condition in the active region (Fig. 
8 . 16 ). 
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Fig. 8.16 One complete period of an interband cascade laser structure. GaSb regions are shown m yellow, 
with the height of the yellow region corresponding to the forbidden band separating the valence and 
conduction bands. Similarly, the solid blue regions correspond to AlSb, and the shaded region to InAs 
materials. The horizontal black line indicates the energy of the confined valence band in p-type material and 
the bottom of the confined conduction band in n-type material. Optical recombination leading to laser 
action occurs between electrons confined to InAs and confined hole states in GaSb (Lee et al. 1999) 


8.8 Summary 

The quantum cascade laser is an excellent example of innovative device design 
using quantum photonics . Quantum cascade laser devices have been 
continuously developed following the first demonstration in 1994. Twenty years 
later, these lasers can be produced in commercial volumes and are increasingly 
used in mid-IR sensor applications (Razeghi et al. 2015). 

The quantum cascade laser is a unipolar device based on electron transport 
through a series of engineered quantum wells . Photon radiation occurs when 
electrons make a quantum transition between two subband energy levels. The 
successful operation of the quantum cascade laser is a balancing act between 
population inversion maintained by resonant phonon evacuation of carriers from 
the lower level and non-radiative depletion of the upper level population caused 
by optical phonon scattering. The electron non-radiative relaxation between 
subbands with energy separation higher than the optical phonon energy is much 
shorter (>1 p s), than the radiative recombination time (>1 ns), resulting in a very 
low radiative efficiency (<1(T 3 ) compared to interband diode lasers. 

A resolution to this difficulty was proposed in 1995 by Yang. His design 
retained the multiple-section laser structure by using tunneling between quantum 





































wells and introduced an optical gain region based on interband recombination . 
This design, known as the interband cascade laser eliminates a major difficulty 
of the quantum cascade laser, which is the depletion of population inversion via 
non-radiative, inter subband optical phonon scattering (Fig. 8.17). 



Wavelength, pm 

Fig. 8.17 Threshold power density (expressed in kW cm versus operation wavelength for some mid-IR 
semiconductor lasers. The interband cascade laser (ICL) requires a significantly reduced threshold power 
density compared to that for the quantum cascade laser. This difference reflects the elimination of non- 
radiative losses due to undesired optical photon scattering. Diode lasers are color-coded in black (results 
from University of Wurzburg, Germany and Nanoplus, Germany). Quantum-cascade lasers are red (results 
from Northwestern University, USA and Alpes Lasers SA, Switzerland). Interband-cascade lasers show 
significantly lower threshold power densities in the 3-6 pm range (green results from Naval Research 
Laboratory, USA; blue results from University of Wurzburg, Germany) (Reprinted from the May 2013 
edition of Laser Focus World with permission from L. Nahle, L. Hildebrandt, M. Kamp, and S. Hofling, 
“Interband Cascade Lasers: ICLs open opportunities for mid-IR sensing” Laser Focus World, May 2013, 
Copyright 2016 by Penn Well) 

The interband cascade laser, unlike the quantum cascade laser, is a bipolar 
device, containing both n-type and p-type regions, determined by the unusual 
type-II valence band alignment between InAs and GaSb, rather than by doping 
(Kim et al. 2008). 

Because of lower losses from non-radiative recombination the threshold 
power density in ICL is lower by an order of magnitude compared to QCL. In 
remote sensing operations , where power consumption is an important parameter, 
the ICL has a superior performance relative to the QCL in the wavelength region 
of 3-6 pm. 



Exercises 


8.1 Evaluate the intersubband optical dipole matrix element between 2 levels i 
and j, and show, following West and Eglash, that the general expression is: 


Mij = qL H 


8 U 
k 2 U 2 - ff 


8.2 Quantum cascade lasers have been successfully designed using a wide 
variety of structures. This direction-transition device, emitting at 4.6 pm 
was designed by Carlo Sirtori, and co-workers (1996) (Fig. 8.18). 



Evacuation n-doping injection amission Resonant 

photon transfer 

Fig. 8.18 Quantum cascade laser structure. Epitaxial layers are formed using Gao. 47 Ino. 53 As and 
Alo. 52 Ino. 48 As. Layer dimensions are given in Table 8.2 


Table 8.2 Layer geometry for a direct transition quantum cascade laser operating at 4.6 pm 



Alo.52Ino.48As, m* = 0.08 (nm) 

Gao.47Ino.53W m * = 0*043 (nm) 

Evacuation 

2.7 


Evacuation 


2.2 

Evacuation 

1.2 


Evacuation 


2.1 

Evacuation 

2.0 


Evacuation 


1.8 

N doping 

1.8 


N doping 


1.7 

N doping 

2.0 


N doping 


1.6 

Injection 

2.2 


Injection 


1.6 




















































Injection 

2.4 


Injection 


1.4 

Injection barrier 

6.8 


Emission 


4.8 

Evacuation barrier 

2.8 


Resonant phonon transfer 


3.9 


(a) Using the data given, calculate the position of the quantized levels in each 
quantum well. Determine the energy difference between the n = 1 and n = 2 
levels in the emission layer. 

(b) Calculate the miniband width of a series of 5 quantum wells having a width 
of 1.8 nm confined by 6 layers having width of 2.0 nm. 

(c) At zero applied bias voltage, what is the energy difference between the 

n = 1 quantum well in the emission layer and the n = 1 quantum well in the 
photon transfer layer. 

(d) What range of applied bias voltage would align the quantum levels in the 
evacuation and injection regions (within the miniband width that you 
calculated in 8.2b)? 

8.3 Design a direct tunneling structure for the injection section of the ICL 

shown in Fig. 8.16: that is, 5 barriers and 4 quantum wells. Assume that the 
materials used are AlSb for the barrier and InAs for the well regions. 

Assume that the operating electric field is & - ^ Vcm ', and that the 
emission energy is 0.37 eV. 

(a) Determine appropriate thicknesses for each of the barriers and wells. (Hint 
treat each of the quantum wells independently to determine the lowest 
energy bound state in each of the 4 quantum wells. Then apply the electric 
field and adjust the well parameters so the ground-state energies are 
aligned energetically at this value of the electric field. Show that this 
sequence will also give the desired emission energy. 












(b) Estimate the energy width of the lowest energy tunneling miniband for the 
integrated 5-barrier/4-well structure? (Hint: use the average of the well 
widths determined in 8.3a and compute the transmission coefficient for the 
complete structure, ignoring the electric field). 


8.4 The pseudopotential method introduced in Chap. 4 can be used to calculate 
the band structures of many III-V semiconductors. 

(a) Using the pseudopotential method, estimate the valence band offset 
between InAs and GaSb by calculating each bandstructure and comparing 
the position of the valence bands. 

(b) The accepted value for the valence band offset is -0.51 eV. The parameters 
of the pseudopotential calculation can be varied to yield this value of the 
valence band offset. Which parameters should be changed, and by how 
much? 
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Abstract 

The electric field of a beam of photons propagating through a medium produces 
polarization by spatially deforming the charge distribution in the outer shell of 
electrons. The relationship between the polarization and the electric field is the 
susceptibility. The polarization of the charge distribution is opposed by a 
restoring force of each atom. The first order susceptibility represents the linear 
response of the medium to the electric field. When the medium lacks a center of 
inversion symmetry, it follows that the relationship between the electric field and 
the polarization is no longer linear. The non-linear optical response of materials 
is a field of study that is rich with phenomena and applications. Second 
harmonic generation converts two photons having frequency = OJ into one 

photon having frequency = - OJ . Parametric amplification involves beams of two 
different frequencies at the input: a stronger pump beam at ^ and a weaker 
signal beam at ^. A special case of parametric amplification occurs when the 
signal amplitude at is zero. Analysis by Maxwell’s equations shows that no 

amplification occurs at the output. However experiment shows that measureable 
spontaneous fluorescence occurs. This fluorescence is called spontaneous 
parametric down conversion. Its existence demonstrates the limitation of 
Maxwell’s equations to the classical domain and beams of light composed of 
many photons. Spontaneous parametric down conversion is widely used to 



prepare single photons and pairs of entangled photons. 


9.1 Introduction 

An applied electric field polarizes the medium in which it exists. The relation 
between the two is written 


P = (9.1) 

where X is the susceptibility , and is the permittivity of free space. The 

susceptibility is in general a complex quantity and accounts for both dielectric 
(loss-free) and conductive (lossy) behavior. In this chapter we will consider only 
dielectric media , so that A' is a real parameter. In the case of an anisotropic 

material, X ~*Xy and relation (9.1) becomes: 

P. = a (9.2) 

The electric field of a beam of photons propagating through a medium 
produces polarization by spatially deforming the charge distribution in the outer 
shell of electrons. The polarization is synchronous with the electric field and in 
the case of an ideal dielectric, loss-less as well, so that the polarization is in 
phase with the displacement. 

P<f) = M*x<f) (9.3) 


9.2 Non-linear Response of Optical Materials 

The wavelength of an optical wave propagating through a solid dielectric 
material is about 10 3 times longer than the interatomic spacing. While the 
deformation of the charge distribution takes places at optical frequencies 
(~10 15 Hz) it is distributed over a significant distance compared to the electronic 
cloud that surrounds an individual atom or crystalline unit cell. 

The polarization of the charge distribution is opposed by a restoring force of 
each atom, leading to a situation similar to that of the harmonic oscillator . In a 
one-dimensional model we can express the potential: 



In a material with a center of inversion symmetry , V (x) = V (-.r), and thus 
b = 0. 




V{x) = ‘^ajr + --- (9.5) 

We recognize the first term in (9.5) as the potential for a harmonic oscillator . 
We recognize that a = tiJ o, and the restoring force on the electron is: 


F(x) - qE (jt) 


dx 


V (x) = -mtifax 


(9.6) 


x (r) = —^ E (*) 
mti)z 

However, if the material lacks a center of inversion symmetry then the 
potential can take the form of (9.4). In this case restoring force on the electron is: 


Fix) 


dx 


V{x) 


-matkx 


mbx 


(9.7) 


It can be seen immediately from (9.7) that the relationship between the 
electric field and the polarization is no longer linear, and that for a given 
magnitude of electric field, the polarization will be less when the amplitude of 
the field is positive than when the amplitude of the field is negative. This 
situation is diagrammed schematically in Fig. 9.1. It can be seen that the 
sinusoidal oscillation of the electrical field is transformed by the non-linear 
potential into a polarization that no longer has a pure sinusoidal form (Yariv 
1975). 



Fig. 9.1 The polarization is the response of a medium to the presence of an electric field. When a photon 
passes through a material lacking inversion symmetry, the polarization becomes a non-linear function of the 
electric field. The sinusoidal oscillation of the electric field is distorted 









(9.8) 


The driving force comes from the electric field of the incident photon 



and the inertial force is 

(d 2 , d , \ 

f = m I df . x ^ + ^0 (9-9) 

The second term in (9.9) represents possible losses. 

These three forces act together following the schematic diagram in Fig. 9.2. 


F{r)=^Eif) 


F(f) = *(/) - bmx 2 (?) 


p { 0=i 


dr 


*0 


Fig. 9.2 Schematic force diagram on an electronic state in the outer shell of a material due to the photon 
electric field 


+ y-x{t) + <^x(t)+bJ (t) = (*** + *-*) (9.10) 

To proceed, we will assume that propagation takes place in an ideal 
dielectric, so that 7 - 0. For greater clarity in the presentation we will also ignore 

the complex conjugate terms (for example e ~ K ). 

We will analyse (9.10) by assuming a trial solution for the displacement 
function ■*(*). 


= \ (*!«'“' + + «) (9.11) 

This substitution transforms the differential equation into an algebraic 
expression. Substituting this trial solution into (9.10) gives 


- ^ (kicj 2 ^ + 4ik 2 td 2 e Uwl ) + Y kieiw> + y*^ 2 

+ -(ir.e 2 ^ + 7k\k 2 P iuit + ~ U iwl ) 

4 v ; 2m v ; 

This equation is valid for all t, and this requires that the coefficients of the 
terms involving L be equal on both sides of the equation, and the same for the 


„2iwl 


(9.12) 


coefficients of f ‘ 


Consider first the terms involving r 




2 ' '2 

Solving for & 




2m 


V ) 


k. 


qE 
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The polarization at OJ is written as: 


P(CJ) 


Nq 


x{t) 


{*i^“ r + cc) = y \x ipj) Ee iwS + cc] 


Nq 

2 r ''" "T 

where we have introduced the linear susceptibility xi^). 
Solving for X (<*>), 


Nq 


X<^)=Xt = - r k < = “ 

It follows that: 


Nq | 

qE 1 

Nq 2 

1 1 


■v 
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1 
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5s 

_k" 




jw _ N( f 
Xl 
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~ 4t,j2 


Next consider the terms in e ~ 


(9.13) 


(9.14) 


(9.15) 


(9.16) 


to- k 

-7k 2 <u 2 + —k-> + -Af = 0 
2 4 1 


k 2 = 


k\ b 


q 2 E 2 


b 


2 4w 2 


2m 2 


V 0 V jlu) . \ 


(9.17) 


The non-linear susceptibility 4^ ' is defined from the expression for the 
polarization at (Bloembergen 1996; Shen 2003). 


pf2tu} _ Nq 
2 


{k 2 e^ + cc)^ l -\d^E 2 e^ + cc\ 
—bNq 2 XT' 


£? } =- _ 

{*>5 - <*?) ( w o- 4fj2 ) 


2N 2 q i 


(9.18) 

(9.19) 


In a 3-dimensional system, the non-linear susceptibility is not a constant, but 
rather a 3rd order tensor . In general, the definition is 

























The tensor ^ has 27 components. There is, however, no difference between 
the results when the indices j and k are interchanged. That is, 

d ijk = dw (9.21) 

This reduces the number of independent components to 18. When the 

propagation medium is a crystal, symmetry simplifies the situation even further, 
so that in practice for interesting non-linear optical crystalline materials there are 
only 1 or 2 independent elements (Boyd 2008). Thus, the subscripts kj can be 
replaced by a single number. 


_JCJC —» 1 

yv -» 2 
ZZ -> 3 
yz = zy~* 4 
xz = zx -* 5 
xy = yx —> 6 

The electric fields are combined into a 1 by 6 vector 
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The polarization is calculated in the usual way: 
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UJ 
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1 . 2 E x Ey j 


(9.22) 


(9.23) 


(9.24) 


Example 

KDP—potassium dihydrogen phosphate belongs to symmetry class 4 ~ m 

(tetragonal a 4-fold axis, two 2-fold axes and a mirror plane). The non-linear 
susceptibility tensor for this class is: 













(9.25) 



f 0 0 0 d ,4 0 

0 0 0 0 d\^ 

0 0 0 0 0 


0 

0 


die 


■■ 


For KDP (Gamdan Optics 2016), 

di 4 = 035 pm - V” 1 
d^ = 0,44 pm — V- 1 

GaAs is a cubic zinc-blende structure with symmetry class 4 ^ m . The non¬ 
linear susceptibility tensor for this crystal class is: 


ijk 


0 0 0 d l4 0 0 

0 0 0 0 di 4 0 

0 0 0 0 0 d 1 4 

For GaAs, = 94 P m ~ v_ ' (Skauli 2002). 

/^-Barium Borate (BBO) Ba(B0 2 )2 trigonal class 3m 


(9.26) 




0 0 0 0 d | ^ —(I 

~dj2 d22 0 d | s 0 0 

( d^i dn d 33 0 0 0 

For BBO (Coherent 2016), 


(9.27) 


d 2 2 = 7.22 pm - V” 1 
d 3l = 0,16 pm - V” 1 
<Aj 58 ^15 ^ d 3 1 

A non-linear optical response can occur in any material that lacks a center of 
inversion symmetry. Typically, non-linear effects are measured in crystals, and 
attention is paid to the relationship between the direction of photon propagation 
and the axes of symmetry of the crystalline structure (Wong 2002). However, a 
non-linear optical signal can occur in other circumstances. An important 
example concerns interfaces (Heinz 1991). In most cases, the interface between 
2 different materials does not have a center of inversion symmetry: for example, 
the interface between two liquids. When it is also the case that neither material 
forming the interface has a non-zero non-linear susceptibility , then non-linear 
effects can be used as a sensitive probe of the interface itself. This opportunity 
arises in the study of biological molecular domains that are supported in aqueous 
solution. As a result, non-linear optical spectroscopy is an important analytical 
tool in biochemical research. 











9.3 Electromagnetic Wave Propagation in Non-linear 
Materials 

Summarizing the results of Sect. 9.2, the polarization can be expressed in the 
general case as: 



defined linear polarization along a single Cartesian axis. This assumption 
simplifies the analysis and corresponds to most experimental situations. Since 
we will be working with only one relevant component of the susceptibility 
tensors, we will leave out the dimensional subscripts for clarity in the 
presentation. 

Consider now, the co-linear propagation of two monochromatic photon 
beams, of frequency w i and £t ^. The electric field of the total electric field is just 

the arithmetic sum of the two components: 

E r (t) = E,(r) + E 2 (r) 



The 2nd-order non-linear polarization is: 



(9.29) 


Without making the full expansion of (9.29), it can be seen that the second 
order polarization will contain terms with different frequency dependencies, 
notably: 

Second harmonic generation: E\e l!(iii 
Frequency s urn: | E \ | |£ 2 I ^ ^ 11 +t * 1 ' }l 

Frequency d i ffcrc noc: |E 1 1 |£ 2 k^ 1 -£ ' J2> ^ 

And dc polarization = optical rectification: E 2 t + E\ 

For example, in the case of second harmonic generation , w i = and two 

terms result, one with frequency = , and a dc polarization. This is 

diagrammed schematically in Fig. 9.3. 



D 



Fig. 9.3 Asa result of non-linear restoring forces, the optical polarization is a sum of the incident photon 
frequency, its second harmonic and a dc polarization. The dc electric field that results from the dc 
polarization is the manifestation of optical rectification 


We use Maxwell’s equations to describe the propagation of a photon wave in 
a non-linear medium in the z-direction: 


d 2 r/ ( K & 

—E (r, r) = E (r, r) + 


E (r, 0 + 


P L (r, r) + P |VL (r, r) 


where we assume the absence of free charges (i.e. ^ ’ E ( r ' 0 = d). 

Without any loss of generality, we will presume that a planar wave 
propagates in the z-direction, so that jh = = d. 

Next consider 3 waves with distinct frequencies and wave-vectors 
propagating simultaneously in the non-linear medium: 


£■;(;, 0 = jf£ 2 ^2)^“ 2, - fe, + cc] 
and 1^1" = 


(9.31) 


(9.32) 


In the case of propagation in a linear dielectric material, the polarization and 



the electric field are related by the linear susceptibility: 


*T (zj) = = ^[EuW^+rt + cc] 

While the non-linear polarization is a product that mixes the 3 wave 
components. For example: 


(9.33) 


P% l L = [£ 3i £*/[H-^>-<*^>] + a] 


2 \-^-zy (9.34) 

Next we consider wave propagation in a non-linear medium. We will take the 
wave to be linearly polarized. This simplifies the presentation of the algebra and 
corresponds to the usual experimental conditions. Taking the first wave (at ) 

of (9.32), we expand the LHS of (9.31): 


d 2 1 

= 2 


(9.35) 


j j2 

-j tfE u ( Z )^ ll - kll) - 2iki^ [, - k[z) —E u (z)+e Kwit ' klz) -^Eii(z) + cc 

dz dz 2 


We presume that the amplitude of the electric field varies slowly over the 
interval corresponding to the wavelength of the wave, i.e.: 



, d 


d 2 


dz, 


-r^Puiz) 

dz 2 


\$Eu(z)\ 

Neglecting the 2nd spatial derivative of Eu(z), 


(9.36) 


d 2 

dz 2 


ir.Euiz)^ 11 -^ + 2iki^ l - k[z) —Eii(z)+cc 

dz, 


RHS = 


The RHS of (9.31) is written 


d 2 


Euizi^-^ + fia—PML 


(9.37) 


(9.38) 


2 2 

We will restrict our interest to dielectric materials which can be considered 
lossless in the frequency region where they are transparent; that is, 

Combining (9.37) and (9.38), 


-ikie i<flill ' klz) —Eu(z) + cc = NL 
dz at 2 


(9.39) 


where we have used the relationship: = rff*o€oEu to simplify the 


expression. 
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We require that photon energy be conserved; that is ~ h ° J -. Recall 


that rJ T 


*? 




dz. 2 V £ J 

(9.42) 

Likewise, 



d _ . ioj 2 

^dijkE^Eue^ 2 ** 1 + cc 

(9.43) 

+ cc 

(9.44) 


The physical explanation behind 2nd harmonic generation is based on the 
non-linear response of valence electrons to polarization displacement imposed 
by the electro-magnetic field of a photon flux. This description is accurate, but 
gives an incomplete accounting of non-linear effects, because it does not include 
creation of photons from vacuum states. Equations (9.42)-(9.44) tell us that the 
non-linear optical tensor causes the electro-magnetic field of one ray to couple to 
fields of the two other rays. 

This mixing gives rise to other effects in addition to 2nd harmonic 
generation: frequency upconversion, frequency down conversion, parametric 
amplification, and spontaneous parametric down conversion, to name some 
important effects that can be harnessed using 2nd order non-linearities. 

9.3.1 Application to Second Harmonic Generation 

Equations (9.42)-(9.44) can be applied to the generation of frequency sum or 
difference in non linear materials. An important particular case is second 
harmonic generation. For this situation, w i = } and ^ = ^ 1 . Second harmonic 

generation is thus an interaction between 2 rays of photons, diagrammed 
schematically in Fig. 9.4. 
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Fig. 9.4 A schematic diagram of second harmonic generation in a nonlinear optical material. A photon 
beam entering the material with frequency = ^ polarizes the valence electron distribution. Because of the 
absence of inversion symmetry, which characterizes all non-linear materials, oscillation of the electron 
distribution produces a second harmonic photonic signal ^ = and a dc polarization of the material 


~r E ik(z) = 

dz 





+ cc 


where we have substituted *>i - ~f. 


Integrating across the sample from 0 to L 


(9.45) 


Elk {L)~ 0 = w, 
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+ cc 


(9.46) 


We recall from (5.43) that - ki - Ak i s the vector sum of the momenta 

of the photons involved in 2nd harmonic generation. In the semi-classical 
approach being developed here, power can be transferred efficiently from the 1st 
to the 2nd harmonic only is the waves propagate with the same phase velocity 
and remain in phase with each other, i.e., that = (l . We refer to Ak as the 
phase mismatch. 


iML 


Eik (E) - 0 = (i>\ .J—djjkEijE 2 jL — - 
e J J AkL 


+ cc 


The intensity of the 2nd harmonic signal is 


(9.47) 






(9.48) 


In the general case, when ^ * 0, (9.48) is a periodic function of the path 


length of propagation in the non-linear material. It oscillates between 0, where 
all the optical power is present as the first harmonic, and some maximum where 
power is shared between first harmonic and second harmonic emission. The 
maximum value depends inversely on the degree of phase mismatch. This result 
is diagrammed schematically in Fig. 9.5. 
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Fig. 9.5 When the first and second harmonic waves are not phase-matched, the output power is shared 
between the first and second harmonic waves. The amount of second harmonic emission varies periodically 
with propagation length in the non-linear material 


It follows from (9.48), that useful second-harmonic emission can be obtained 
if the non-linear material could be cut to a precise thickness that depends on the 
phase-mismatch . This is not at all practical. However, if we can create phase¬ 
matching between the first and second harmonic waves, so that Ak = 0, the 
situation changes dramatically; 



lim 


(9.49) 


Under phase-matched conditions, the optical power in the second harmonic 
increases with the square of the propagation distance. At some point, our initial 
assumption of non-depletion of the first harmonic pump beam will no longer 
apply, and the conversion rate will diminish. This does not change the important 
result that nearly 100% of the pump power can in principle be converted to the 
2nd harmonic using phase-matched conditions. 

Phase-matching can be achieved by using the natural birefringence of non¬ 
linear optical materials. The phase-matched condition can be reduced simple to 
the requirement that the index of refraction of the 1st harmonic be equal to that 
of the 2nd harmonic wave. In the usual case, the index of refraction is a 
decreasing function of wavelength. 1 Thus it would be not possible to achieve 
phase-matching. However, birefringent materials possess two indices of 
refraction: Ordinary = and = ft <-. In these materials, it is possible to 















achieve phase-matching when the 1st and 2nd harmonic waves are of different 
types: one ordinary and the other extraordinary. The effective index of refraction 
(&)) experienced by the extraordinary wave in a birefringent crystal depends 

on the angle between it propagation direction and the optic axis of the crystal. A 
simple geometric formula relates n * to n * and : 


1 cos 2 {#) i sin 2 (tf) 


(9.50) 


If < n Ti, then there will be an angle the phase-matching angle, where 


(Q m ) - n Q. If the 1st harmonic is launched along the direction as an 

ordinary ray, (that is, polarized perpendicular to the plane formed by the optic 
axis and the direction of propagation) the 2nd harmonic component will be 
generated having the same index refraction and direction of propagation, but as 
an extraordinary ray (that is, polarized in the plane formed by the optic axis and 
the direction of propagation, see Fig. 5.3). Thus, the phase-matched 2nd 
harmonic ray is collinear with the 1st harmonic. We call this type-I phase¬ 
matching . Because this non-linear interaction produces only one additional ray, 
type-I phase-matching is the only possible solution, due to conservation of 
momentum. However, as we shall see shortly, frequency up conversion and 
frequency down conversion produce 2 additional rays. This enables the 
generation of each optical ray at an angle to the pumping beam provided that 
overall photon momentum is conserved in 3 dimensions. This is called type-II 
phase matching . It has the advantage that the 3 optical waves are separated 
spatially, because they propagate in different directions. 

Substituting n o for (3«) in (9.50), 



(9.51) 


(9.52) 


allows determination of the angle of propagation relative to the optic axis to 
achieve the phase-matching condition. 


9.3.2 Application to Parametric Amplification 









Parametric amplification is the general case of non-linear interaction. Three rays 
are involved where ^ (Fig. 9.6). 
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Fig. 9.6 Schematic diagram of parametric amplification involving 3 photon beams. The pump beam with 
frequency ^ and a less intense signal beam at enter the non-linear crystal. The signal beam is 
amplified by energy transfer from the pump to the signal at the signal frequency. This requires the creation 
of a third beam at frequency ^ so that total photon energy is conserved. The ray at ^ is called the idler. 
The non-linear optical tensor implements the mixing among the 3 rays 


Starting with (9.42)-(9.44), we make a change of variable that simplifies the 
expressions: 


F ; = l—E h for i = I, 2, 3 
V O J i 

I 17 12 

l^r is thus proportional to the photon flux . 

The equations for the electric fields are transformed to: 


(9.53) 
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(9.55) 
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This change of variable allows us to use a single coupling parameter *: 


K = 


v 


/J GJ\G.^GJy, 


and 


A k = kj - (i'i + kj) 

In the following analysis, we impose conservation of energy: 


6i*3 = OJ\ + LOJ 

and conservation of momentum: 


(9.57) 








Ajt = 0 

We will assume type-I collinear phase matching , so that all rays propagate in 
the z-direction. In this way, we can replace the tensor product by a simple 
algebraic product and 4 m- at the appropriate angle relative to the optic axis 
for phase-matching. 


^~ F u(z) = -l-Kd'F&F^j 
dz 2 J 

^Z F 2 j(z)= l -Kd'F 3k F u 

l 

~f, F ^ (d = ~^ F U F 2J 

Next, we assume non-depletion of the pump beam so that 
F a (z) = (0) = constant. This leaves 2 equations only 
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- T E 2j (z) = -K J—E 3k md t F u 
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(9.62) 

Simplify by defining a new parameter 7 
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(9.63) 
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4 F 2j(z) = ^d'Fuiz) 
dz J 2 

(9.65) 

Solving for Eu(z): 


En(z) = 

yd'dz J 

(9.66) 

d 2 „„ , f W? „ , 
dZ 2 F 2j(d= 4 nM- 

(9.67) 


We choose as a trial solution: F y(z) - Ccosh ^ + Dsinh 2 ^ 

For initial conditions we choose ray 1 to be the “signal” wave and ray 2 to be 
the “idler” wave. At z = 0 the idler wave intensity is zero. The nonlinear 



coupling builds up the idler wave intensity by generating an electromagnetic 
wave with frequency ^ ^ ^i. These initial conditions mean that and 


yd* 

F^iz) = Dsinh } —z 

Substituting this result in (9.67) 


(9.68) 



(9.69) 


f' 2 j (z) = -iFu (0) sinh 

Finally we restore the original parameters 


(9.70) 



(9.71) 



(9.72) 


These results show that parametric amplification occurs for both the signal 
and idler rays fuelled by the electric field of the pump. A more precise analysis, 
allowing for pump depletion would show how this occurs as the three rays 
propagate through the non-linear crystal. The dependence of the mixing on the 
non-linear optic coefficient, confirms the non-linear optical origin of this effect. 
The ray at ^ grows from 0 intensity at z = Q. 

It can be seen from (9.71) and (9.62), that the mixing effect is strictly zero if 
the amplitude of both input fields is zero. To the contrary, experiment shows the 
presence of spontaneous down conversion fluorescence under these conditions 
(Ling 2008). Thus, the classical electromagnetic analysis is not capable of 
describing observable physics. In Chap. 5 we discussed degenerate spontaneous 
down conversion where = LO - and its practical importance as a procedure for 

generation of entangled photon pairs. Similar to the case of spontaneous 
emission, photons from the vacuum state with states that permit energy and 
momentum conservation stimulate parametric down conversion . 
Straightforward analysis (see Sect. 5.5) shows that two photons each with 
frequency = OJ - can be created from the vacuum state, while simultaneously a 

single photon of frequency ^ is annihilated. The converted intensity 
increases linearly with that of the pump beam, and the conversion rate is very 
small, on the order of one event per ] Q 12 pump photons. 







9.4 Summary 

The non-linear optical response of materials is a field of study that is rich with 
phenomena and applications. Using Maxwell’s equations we have focussed on 
the principles of two important phenomena that are used in studies of quantum 
photonic behavior: second-harmonic generation and parametric amplification . 

Second-harmonic generation occurs as a result of the polarization of a crystal 
by the electric field of a traversing wave. If the material lacks a center of 
inversion symmetry in its structure, second harmonic distortion of the 
electromagnetic wave will occur. The incident pump wave and the second 
harmonic signal co-propagate in the non-linear material. In the case of non-linear 
birefringent non linear optical crystals, there will be a specific direction where 
the propagation velocities of the pump and the second harmonic are the same. 
Propagation along this direction causes the pump and the second harmonic to 
remain in phase, and corresponds to the phase-matching condition. At lower 
conversion rates, when the intensity of the pump beam remains largely 
undepleted, the intensity of the second harmonic component is proportional to 
the square of the pump beam. This shows that two pump photons are required to 
produce one second harmonic photon. Under optimized conditions, nearly all of 
the pump light power can be converted to the second harmonic generation. 

Parametric amplification is a more general technique for mixing photon 
energies. In its simplest form, a pump beam and a signal beam enter the NLO 
material. At the exit, the signal beam has been amplified, and a new optical wave 
appears with frequency equal to the difference between that of the pump and the 
signal, conserving photon energy. In contrast to the case of second harmonic 
generation, the amplification of the signal photon is linearly proportional to that 
of the pump beam. A particular case of parametric amplification is that where the 
amplitude of the signal beam is zero. Maxwell’s equations show in this case that 
there is no amplified beam, and that the “idler” is indistinguishable from the 
pump. Thus there should be no effect. This contradicts experiment which shows 
clearly that photons are emitted satisfying energy conservation. This effect is 
called spontaneous parametric down-conversion , with reference to spontaneous 
emission that occurs in light-emitting devices like lasers and LEDs. A photon 
created from vacuum fluctuations at the input port of the non-linear material can 
be amplified by parametric amplification. This results in the appearance of both 
signal and idler photons at the output. The analysis of this process must be 
carried out using quantized field theory, as we have done in Chap. 5. The result 
proves that the intensity of the spontaneously generated photons is linearly 
proportional to the pump power, in agreement with experiment, which is echoed 


by the classical analysis using Maxwell’s equations in the regime where these 
can be applied. 

Second-harmonic generation converts two photons with frequency = to 
one photon having frequency = Spontaneous parametric down conversion 
can be used to create 2 photons having frequency = w from one photon having 
frequency = - tjJ . Although both processes are non-linear optical effects, it should 

be quite clear that one is not at all the inverse of the other, having their origins in 
completely different physical effects. 


Exercises 


9.1 In the case where propagation in the optical medium is lossy, show that the 
linear optical susceptibility is given by: 


X W) = 


Nq 1 




— t j 2 + 


9.2 Using published data for KDP, calculate the phase matching condition for 
second-harmonic generation of a pump beam having a vacuum wavelength 
of 800 nm. 


9.3 Refer to (9.42)-(9.44) (which explain how the non-linear optical tensor 
causes the electro-magnetic field of one ray to couple to fields of the two 
other rays). Examine the use of a nonlinear optical constant that varies 
periodically: 


4jt-> y{« fc + <r fc ) 


(a) Show that there is a specific periodicity: (- i~ \ that causes the 

exponent in (9.32)-(9.44) to separate into phase-matched and phase- 
mismatched terms. 


(b) Show that averaging your result over a length of several periods will 
cause the phase-mismatched terms to average to zero, while the 
phase-matched terms accumulate. 








This method of partial-phase matching, using an externally-imposed periodic 
variation in the non-linear coefficient, is called quasi phase-matching. It is often 
employed in conjunction with non-linear interactions in optical waveguides, 
where the interaction distance can be quite long compared to that of the periodic 
variation. 
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